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0 . i . This is the first Fascicle of the Languages of the World File — 
an NDEA Office of Education project begun in Washington under the director- 
ship of William R. Parker and Kenneth Mildenberger . The first place that 
the Language File was developed was in the Office of Education, Washington, 
D. C., in response to a practical need. Various administrative and legis- 
lative offices in Washington found it difficult to obtain reliable information 
on the languages of particular countries, as the number of different lang- 
uages and the number of their speakers (language census), and whether 
two different language names represented dialect differences of one lang- 
uage, or a language barrier to each other, and so on. Consultants were 
invited to come to Washington to supply such information within the areas 
of their competence; and the information they gave was placed in a manu- 
script Language File. 

This Language File was then moved from the Office of Education to 
George Washington University in Washington, D. C. where it was developed 
along the same lines, and in addition employed a small staff which excerpted 
external information from published stources, especially information on 
language census. This permitted the compilation of a list of languages hav- 
ing over a million speakers, subsequently published with a preface by the 
principal investigators of the present Fascicle who pointed out that a census 
giving the number of speakers ascribed to language names in the published 
sources could be misleading (Languages Now Spoken by Over a Million 
Speakers, AL 3.8.13-2E, 1961). For example, in the published list the 
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maximum number of spe-kers for any one language in sj)uth Asia is as- 
cribed to Bengali (75 million), since Hindi is listed separately from Urdu: 
but the combined number of speakers of the one language, Hindi- Urdu 
(135 million), far exceeds the number of speakers of Bengali. A reliable 
language census is possible only when external information about the lang- 
uages, is combined with internal information on the languages themselves. 

Finally, the Language File was moved from Washington, D. C. 
to Indiana University, which continued its development in the following 
fashion. First the aid of Corresponding Contributors was elicited — 
specialists were asked to supply bibliography relevant to Sino-Tibetan 
languages, for example, as well as to bibliographic sources for such 
information. Second, a staff of about a dozen advanced graduate students 
in linguistics excerpted the published sources and placed the full citations 
and reduced abstracts of what they excerpted under the appropriate rubrics 
in the Language File. Third, the excerptions were organized and summa- 
rized by the principal investigations for critical appraisal and revision by 
th»ss consultants, either at Indiana University or at the university of the 
consultant. 

This report is Englished by the principal investigators who wish to 
make it clear that the report is, in intent, equally authored by all the con- 
sultants. In our original plan, a iin^l typescript of this report was to be 
sent to each consultant for his addenda and corrigenda . Some additional 
material was obtained in this way, and integrated in the report as it now 
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stands. However, changes were also made without approval of all tha con- 
sultants* listed below and In all- cases the integration of correctioiiR were 

« 

made without subsequent perusal and approval of the consultants who contri- 
buted the correction. It would seem improper to imply formal responsibility 
of joint authors to< consultants who did not have time to examine the final 
version. of this report as it now stands; ^nd though we were willing to give 
consultants all the time that was needed, the terms of our contract called 
for completion of our contract on this and following fascicles by a date which 
Could not be further extended. Hence, we list the consultants of FASCICLE 
ONE here instead of under the title of FASCICLE ONE, as was our original 
intention. The readers of this report as well as the principal investigators 
(C . F. and F. M. Voegelin) have the enormous benefit of co- authorship by 
the following consultants: Y. R. Chao, of the University rf California, 
Berkeley; Nicholas Bodman, of Cornell University, who reviewed material 
presented here but was mainly concerned with materials to appear in sub- 
sequent fascicles, as was Denzel Carr of the East- West Center at the 
University of Hawaii, and the University of California, and as was Robert 
Morse who worked at Indiana University but is now re-engaged in field work 
in Burma; Henrietta Chen, ai ^ formerly at Indiana University but now at 
Cornell* University, is. the* so*.*'* e. of what we call 'acquired Mandarin^and 
also the informant for the Thai sentences which appear in a following fascicle. 
As matters stands this is a very preliminary rather than a polished publication 
on the state and scope of Sino"Tibetan. The principal investigators request 
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^specialized scbolars to supply thenn with jcor^Tg^nd^ and addenda which may 
be published, as such, or incorporated in a future revised version of the 
present report. 

Our development of the Language File included excerpting and critical 
consideration not only of external information, as language census data, but 
also of internal information on the languages themselves — partly in order to 
control and thus add to the reliability of external data, partly to make the 
fascicles relevant to the broader interests of administrators , legislators, 
and scholars who are not linguists, but who are practically or theoretically 
concerned with communicational complexities existing in the world today. 

The internal information about the languages themselves is given in 

non- technical observations after illustrative sentences — * observations on 

whether the structure illustrated is productive (generative), i.e. widely 

applicable to other sentences with different selections of topics, comments, 

or phrases, for example; observations on how the structure illustrated is 

S 

like or unlike other areas of the same grammar or similar grammars, as 
Thai in reference to Chinese; and observations about different functions ful- 
filled by a particular structure illustrated. Each illustrative sentence in the 
source language is flanked by English translations * preceded by a free 
translation and followed by a literal translation in which the glosses in 
English follow the order of 'words' in Chinese, for example. 

The set of all sentences for each language or dialect group is then 
followed by a sketch of its sound system . Here we give little more than 
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the value of the letters (phonemes) in which the preceding illustrative sentence 
were written. 

But for the higher grammatical levels, we p’*esent samples from the 
language the illustrative sentences — — which the observant reader may 
understand directly and immediately. Our remarks appended to these 
illustrative sentences are offered in the spirit of observations for the 
observant. And if the obser^'^ant reader, guided by the illustrative sentences, 
anticipates our appended observation, so much the better. In short, this 
presentation is intended to give the reader internal or grammatical infor- 
mation. about Chinese, for example, derived from his ov/n observation of a 
sample of Chinese. 

All this might be called tlie presentation of grammar without linguistics 
but not without benefit from linguistics. We are all indebted to structural 
linguistics for justifying the particular structure of a particular language. 

It is the justification that is hard to follow for the non- linguist — not the 
habits of the speakers of a particular language. Grammars ar 2 justified 
statements about those habits. After all, every Chinese child learns to 
follow the habits of his predecessors in a particular Chinese dialect, without 
linguistic training. But, of course, the child is only an exemplar of such 
habits, and does need linguistic training before he can make valid statements 
about them. When the ethnolinguistic or sociocultural factor is included, then 
language as the product of speakers^ habits appears in an expanded frame of 
reference — the functional fraine that Ferguson has called the language 
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situation in a particular country. 

However, if we . are to begin with the language situation in 
China and go on to the language situation in other countries - in Tibet, and 
South Asia and much of Southeast Asia in which Sino” Tibetan langua.ges are 
spoken - we should encounter so m«*ny unrelated languages that the picture 
of Sino-Tibetan languages would become blurred, and the list of Sino-Tibetan 
languages given in indexes to the present fascicle would have tc be taken out 
of a welter of all the language names that are encountered in the countries in 
which Sino-Tibetan languages are also spoken. We can still refer to these 
other languages - languages not related to Sino-Tibetan but in terms of 
their language family labels (as Dravidian or Indo-European or Malayo- Poly- 
nesian) rather than in terms of their individual language names. are tliere 
enabled to focus on Sino-Tibetan languages, and list them separately as 
individual languages, though in the context of the countries in which they are 
spoken. In successive fascicles, the other languages of the countries will be 
listed separately, with the cumulative result of rbtaining a list of all lang- 
uages for all countries. 

Here we are sufficiently concerned with Sino-Tibetan languages. How 
many are there? Limiting ourselves to language names about which some 
linguistic data is published or is in accessible manuscript form, we obtain 
a list of several hundred language names. How many people speak Sino- 
Tibetan languages and where are they located? The speakers of these lang- 
uages are very unevenly distributed. Most are found in mainland China, 
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whose populaticn has now reached 750 million including non-Chinese 
speakers — according to information vouchsafed t© President Kennedy 
recently (December 17, 1962, televised press conference). In Burma there 
are l6 million speakers of Sino-Tibetan languages, and in other parts of 
Southeast Asia, as Thailand and Laos, such speakers are also counted by 
the millions. In addition, people in northern India (Nepal, Sikkim. Bhutan, 
and northern Burma) , speak numerous Sino-Tibetan languages which average 
about 180 thousand speakers for each language. In terms of density of Sino- 
Tibetan speakers, mainland China is enormously dense, while Tibet a,nd South 
Asia are more sparsely represented. But in Southeast Asia, the number of 
Sinu- Tibetan speakers is again relatively dense, especially when the over- 
seas Chinese are counted with speakers of national languages, as Thai in 
Thailand. The overseas Chinese in Singapore outnumber the native Malayo- 
Polyneaian speakers in Singapore, for example. 

The family tree model is more appropriate for the different language 
families in the Sino-Tibetan phylum than for the phylum itself. Each lang- 
uage family treated here is taken as a constituent of the Sino-Tibetan phylum. 
This and following fascicles list languages in the following nine language 
families: Han Chinese; 2 ^ Miao-Yao; Kam-Thai; ^ Burmese-Lolo; 

5 . Karen; Bodo-Nagai -Kachin; Naga 2 "Chin; ^ Gyarung-Mishmi; 

9 . Tibetan. Each of these nine language families are reconstructed (or 
apparently reconstructible) fpr nine different parent languages. After these 
are actually reconstructed, their relationship to each other can be appreciated 
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in Sino-Tibet phylum linguistics, rather than controverted. We avoid the 
controversy by giving exclusive attention to the languages spoken today in each 
of the constituent language families. That is t® say, we wish to avoid becoming 

entangled in the controversy over competing classifications which we outline m 
the following section (0 . 3 . ) . 

- - - • ^ • There are over 30 million speakers of languages of the Kam-Thai 
family, of whom over 20 million speak languages belonging to the Southwestern 
branch ~ Thai, Lao, Shan, Thai Noir, Paiyi; in addition Maspero (1912, 1952) 
postulated Vietnamese as a member of this family. 

Disturbing questions now obtrude. Does Kam-Thai, for example, belong 
to the Sino-Tibetan family, if Sino-Tibetan is a language family ? Or does Kam- 
Thai belong with Malayo- Polynesian languages? There is an interesting history 
in this controversy over classification. Wilhelm Schmidt's classification includes 
Kam-Thai in Sino-Tibetan and in addition postuliate-s Austric as ancestral to 
Austronesian (Malayo- Polynesian) and to an Austroasiatic phylum (including Munda 
and Mon Khmer) and then postulates a relationship between Sino-Tibetan and 
Austric— thereby making Malayo- Polynesian languages in the Pacific related to 
South, Southeast and East Asian languages. George Grierson's classification 
starts with Sino-Tibetan as the earliest ancestor, and has the daughter languages 
divided into Thai-Chinese languages and Tibeto- Burman languages. P. K. Benedict 
begins with bifurcating branches as does Grierson, but has the daughter languages 
divided into Tibeto-Karen languages and Chinese languages - rather than Thai- 
Chmese. This is because in Benedict's view, the Thai languages are daughter 
languages of another parent language, also called Austric, v/hich also bears an 
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ancestral relation to Mon-Khmer, Vietnamese, and Malayo- Polynesian 
langiages (as well as to his 'Kadai', which we include in Kam-Thai). This 
differs from the first classification noted above chiefly in that two different 
parent languages are postulated by Benedict (Sino-Tibetan and Austric) while 
one parent was previously postulated ancestral to Mon-Khmer and to Malayo- 
Polynesiar languages as well as to Sino-Tibetan, with the latter parental to 
Thai-Chinese. Most subsequent summaries of languages of the world follow 
either the more inclusive classification (e.g., L. H. Gray), or the less 
inclusive Grierson classification (e.g. E. Kieckers); one recent summary 
(Greenberg's ) seems to have followed the least inclusive Benedict classification, 
which does not even include Thai among the Sino-Tibetan languages. The 
fundamental issue here does not hinge on the present-day data (since everyone 
can see the plethora of similarities between Chinese and Thai), but on the inter- 
pretation of these similarities in grammar and lexicon. If the similarities are 
thought to reflect a reconstructible Proto Thai-Chinese, then Sino-Tibetan is 
still ancestral to Thai-Chinese but more distantly so. But if the similarities 
between Chinese and Thai are thought to be a consequence of diffusion between 
the two, then the problem is shifted to areal linguistics, which treats questions 
of borrowing between languages that are not necessarily related. 

This mxght end in controversy if our alternatives were to accept or reject 
Sino-Tibetan as the parent language of the languages variously classified under 
it. Controversy is avoidable, however, if Sino-Tibetan is postulated as the 
non-reconstructed phylum ancestor of a number of language families. 
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each with its own reconstructed (or obviously reconstructible) parent lang- 
uage. No one can quarrel with setting up Sino-Tibetan as a cover term for 
languages that bear some similarity to each other, beyond random similarity. 

If Thai does not belong in this phylum (as Benedict says it does not), it belongs 
in another phylum in the Pacific, with a different phylum ancestor; then it 
might be said (as Schmidt has said or rather implied) that a macro-phylum 
ancestor (Austric) does ultimately relate both the Sino-Tibetan phylum and 
the Thai languages. For further details of the differences between the Schmidt, 
Grierson, and Benedict classification and Shafer’s summary, see the appended 
classification charts. The differences among these classification charts 
suggests that fruitless controversy would follow if we were to regard Sino- 
Tibetan as one language family rather than as a linguistic phylum, which is 
ancestral to many language families. We look for a reconstructed or recon- 
structible parent language for each language family under the phylum. These 
more modest language families then account for the closer relationship between 
groups of languages shown nearer the bottom of each chart in a given family 
nearer the present day differentiation — and the remoter relationship between 
language families, of which there are nine altogether, is postponed as a problem 
of Sino-Tibetan phylum linguistics. 

This is essentially the direction in which both T.H. Tung and Y. R. Chao 
move in their treatment of Sino-Tibetan languages spoken within China and 
in Southeast Asia. They set up what amounts to a Chinese language family, 
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a MiaO“Yao language family, and a Kam-Thai language family; but they lump 
the remaining languages of the Sino-Tibetan phylum into a non- coordinate 
Tibeto- Burman branch of the main phylum. The latter is susceptible to 
language family grouping, just as Tung and Chao have set up Chinese, 
Miao-Yao and Kam-Thai in East and Southeast Asia; and credit belongs to 
Robert Shafer for the first glimmering of this in his attempt to break up 
the non- coordinate Tibeto- E irman branch into coordinate language families. 

The breakthrough in finding order among the Sino' Tibetan languages 
comes with the increased recognition that we are actually dealing with more 
or less distantly related language families, each of which has to be described 
before agreement can be reached in Sino-Tibetan phylum linguistics. The 
family tree model for the Sino-Tibetan phylum given in all previous classifi- 
cations postulates more than is known and hence ends in a list of non- agreeing 
charts, such as those given below — or even includes an American Indian 
language family (Athapascan) . What is known is that there is more similarity 
among Sino-Tibetan language families than among any randomly selected group 
of nine or ten language families in any part of the world. What is not yet 
known is how much of the similarity among Sino-Tibetan language families is 
reconstructible, and how much is a consequence of diffusion, possible because 
of the coexistence of Sino-Tibetan language families in one vast area extending 
from north India and from Southeast Asia through China into Manchuria. 

A set of charts, and also further discussion, of the classification of 
Sino-Tibetan languages, according to Wilhelm Schmidt, according to George 
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Grierson, according to P. K. Benedict, according to the Chinese scholars 
cited and others, and according to Robert Shafer are given in subsequent 



fascicles. 
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Of all large countries, mainland China comes closer than any 
other to being monolithic or megalithic in the habits and appearance and 
numbers of its speakers. It is easier to understate than to overstate this 
aspect of the languages and ipopulations of mainland China. obtain a 
relative view, compare mainland China and India, 

In population China is first in world rank; India is second. There are, 
roughly spealdng, two billion people in all of Asia# Almost half of this vast 
population is Han Chinese. More than one quarter of the two billion people 
of Asia (some 600 million) live in South Asia — that is, in India, Pakistan, 
Ceylon, Nepal, Sikkim and Bhutan; and some geographers also include 
Afghanistan in South Asia. 

Languages in one family of South Asia are related to English (Indo- 
European, hereinafter TE), but these languages, intrusive from Europe and 
West Asia in prehistoric times, have been influenced to sound in some respects 
like languages which were in India before the arrival of IE speakers — at 
least those of the Dravidian and Munda families. Still other language 
families, intrusive from Tibet and China, belong to the Sino-Tibetan phylum. 
And, the total number of different or separate languc^es — ■ each constituting 
a language barrier to the other (whether related in the same family or not) 
is well over a hundred for South Asia, However, in mainland China, the 
vast majority of the 750 million inhabitants speak what we term Han Chinese. 
Some 95 per cent of the 750 million inhabitants speak one or another of the 
Han Chinese languages, and' 70 per cent of the !^Han Chinese speak Mandarin; 
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the remaining 30 per cent speak Wu, Min, Hakka, Cantonese, and Hsiang. 

There are also eleven million Han Chinese speakers in the official Republic 
of China of the United Nations (=Nationalist China=Taiwan=Formosa)«» a 
refugee government on part of its own territory. And additional millions of 
overseas Chinese speak Han Chinese^* mostly in Southeast Asia. 

Within China itself the non-Han speakers are also regarded as being 
non-Chinese. Some speak an unrelated foreign language, as English; many 
speak various tribal languages belonging to other (but only to two other) language 
families in what is sometimes called 'China proper' - Miao-Yao and Kam- 
Thai (cp: following fascicles). Both of these, like Han Chinese, are language 
family constituents of the Sino-Tibetan phylum. 

Taiwan's population consists of a majority of speakers of Southern Min, 
perhaps half of whom can also use Mandarin, plus a minority of native speakers 
of Mandarin and now only a very small sprinkling of speakers of the non-Han 
tribal languages which belong to the Austronesian language family, which is 
not a constituent of the Sino-Tibetan phylum. The island of Hainan** under 
Communist control, though not part of mainland China-*’ probably has fewer 
speakers of Han Chinese (specifically, Southern Min) than of Li which belongs 
to the Kam-Thai family (also a constituent of the Sino-Tibetan phylum)^ be side s 
possibly speakers of Austronesian languages. 

The notion of 'China proper'^ in general area extending south of 
the Great Wall— -is convenient for distinguishing the part of China in which 
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Han Chinese, languages are predominant from the rest of the country which 
may be called China's frontier provinces. The area covered by both 'China 
proper' and the frontier provinces is much greater than that of the United 
States (though 'China proper' alone is somewhat smaller). Less than a 
fourth of the inhabitants who speak a language which is Sino-Tibetan— but 
not Han Chine sei^^ reside in 'China proper'; three-fourths of them reside 
in on^ pr„ anot;]ier o^ CJiina's. froi^tiet* provinces, and in some of these there 
also reside peoples who speak Altaic languages which are neither Han (in 
language family affiliatioh')’ nor Sino-Ti.bstan (in phylum affiliation). 

Thus, there are 5 million Uighurs in Sinldlang province, speaking Altaic 
languages known as Kazak (l«*li) andKhalkha (K* e-tzu-le-su), beside the 
Mohammedan Hui (Ch'ang-chi). There are more Mongol speakers in China' s 
province of Inner Mongolia than there are in the Mongolian Republic. 

Most inhabitants of Tsinghai province and some in Ninghsia province, as 
well as in Manchuria, speak Altaic languages. The remaining frontier 
provinces include many language s which are non -Han Chinese but still 
Sino-Tibetan, as the three Tibetan languages in China's province of Tibet, 
with still other Sino-Tibetan languages represented in most of Yunnan province, 
most of Kwangsi province, and in parts of other provinces (Szechuan, 

Kweichow, and western Hunan). Nor are the Tibetan languages confined to 
the Tibetan province; they are also spoken in adjacent Yunnan where a total 
of 6 million people speak Sino-Tibetan languages that are not Han Chinese. 
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The population of all of South Asia (600 million) approximates that of 
Han Chinese speakers alon^ - variously estimated as 94 or 95 per cent 
of the total population, including speakers of all languages, both in 'China 
proper' and in the frontier provinces. Language census data in recent 
years show steady increase of 15 million or about 2% annually for the total 
population in China. Thus, the total figures published in 1948 give 463 million, 
those in 1953 give almost 583 million, and those in 1962 give 699 million, 
with an even higher total (750 million) given in President Kennedy's news 
release. If we estimate the Han Chinese speakers to be 95 per cent of 
these totals, then their numbers for 1948 are 440 million; those for 1953 
are 554 million; those for 1962 are 664 million, or even more (710 million). 
However, when language census figures are itemized for particular Han 
Chinese languages or dialects, the sum for all Han Chinese speakers falls 
below rather than above 600 million (but some itemizations include more dialect- 
language names than others). We cite an itemization from a Chinese source 
(Lou Cheng-Pei and Lli Shu-Xiang, Yuyan Yanjiu, p. 4, 1956) which,though 
closely similar to a Russian source (G. P. Serdju^enko for 1959), does not 
include the dialect- language name Kan to which the Russian source ascribes 
13 million speakers. The 1956 itemization speaks of dialects rather than 
languages; of the Hakka dialect (with 20 million speakers), of the two Min 
dialects (with 22 million speakers), of Hunan , i.e. , Hsiang (with 26 million 
speakers), of Cantonese (with 27 million speakers), of Wu (with 46 million), 
and of Mandarin (387 million, including Northern, Southern, and Southwestern 



Mandarin) . 



18 



Anthropological Linguistic s. Vol. 6, No. 3 



The monolithic language situation, then, is surely the result of the 
numerical preponderance of Han Chinese speakers in China, but only 
partly so. Several other factors contribute to the same monolithic result, 
despite one factor which would seem to have the opposite effect. That one 
factor is that there is not one Chinese language; rather there are six or 
seven different Chinese languages. In this western sense, Han Chinese is 
a language family. But why do non-western Chinese scholars generally — 
and for good reasons, though not for the reasons adopted in western dia- 
lectology — speak of dialects rather than languages? In the Chinese 
view (as well as in the view of many western Sinologists) Hakka, Min, 

Hsiang, Cantonese, Wu, and Mandarin are dialects of a single Chinese 
language . 

These Han Chinese dialects or languages are unified enthnolinguistically 
if not linguistically. Whether viewed as dialects of one language, or as 
separate languages in one language family, no one can quarrel with calling 
the whole 'Han Chinese'. Han Chinese is about as unified when taken as a 
cultural construct, as when taken as a linguistic construct. 

There are cultural differences in mainland China, such as that 
based on rice subsistence in the South, and that based on wheat subsistence 
in the North; representatives of the latter are distinguished as 'noodle 
eaters' by Chinese students who come to America from South China. How- 
ever, both North and South Chinese are equally Han. Also unifying is the 
stereotype of the Middle Kingdom Man; but this stereotype does not 
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coincide with the referent of Han Chinese when it includes some non-Han 

Miax)'-¥'^P. 

There are linguistic differences of a complete language barrier 
kind between Mandarin and Cantonese, for example, leading to the western 
view of separate languages — but all belonging to the Han Chinese language 
family. We do not treat the linguistic diversity of earlier times, but note 
in passing that linguistic diversity is not confined to modern China, 

It is only in the ethnolinguistic view that the monolithic construct 
of Han Chinese emerges. This is the view that is given below. From a 
superficial point of view, it would appear that Han Chinese as an ethno- 
linguisiic monolith has played into the hands of the Communists who 
succeeded in uniting mainland China, while in South Asia the splintering 
of languages and cultures h»is stood as a buli"/ark against unification. 

But the fact is that the Communists are playing down the unification 
influence of Han Chinese; what we call Han Chinese is called by them 
H^n Yu, to distinguish Han Chinese from Miao-Yao, etc. The Communists* 
implication is that Han Chinese is only one of several tribes and languages 
in the Middle Country; the others, though minority tribes and languages 
of the Middle Country, are then regarded as being ethnically Chinese. 

Writing in Chinese characters began at about the time that the 
Sumerian language became extinct — i.e. about 2000 B. C. Though Chinese 
is not the oldest known kind of writing, it is the oldest writing of its kind 
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that has not become extinct. Other writing systems that were known in 
the Most Ancient East (from Sumeria to Egypt and Minoa) had to be 
deciphered after millenia of disuse; Maya glyphs (characters) in 
Mesoamerica are now being deciphered after centuries of disuse. In 
their continued use of the alphabet -included logographic system, the 
Chinese alone have kept alive a type of writing which no other culture 
was able to keep alive. All writing of this type uses some symbols 
(characters) to specify words or smaller parts of words (morphemes); 
and that is all that is meant by ’logographic’. Chinese characters are 
’logographic' rather than 'ideographic'; they are word-symbols (or more 
exactly, morpheme- syrr*bols) rather than idea- symbols. And they are 
cartain*y no more pictographic than English words are sound— —imitative — 
in both cases, a few are; most are not. 

Chinese newspapers keep several thousand characters on hand. 

The Chinese telegraphic code book is limited to 10 thousand character s. 
(This IS not (^uite as arbitrary a closed corpus as it has sometimes 
been said to be. The telegraphic code provides 10,000 four-digit groups — 
0000 to 9990 — for over nine thousand characters, with a few unused or 
blank spaces; and in addition there are available some graphic variants 
and obsolete characters. ) And the fullest unabridged dictionary, begun in 
the 18th century, gives some 50 thousand characters. Attempts at 
simplifying character writing wiere made by early missionaries and by 
recent Communists. During the EOth century, Chinese scholars have also * 
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used adaptations of our self-sufficient alphabet with its low number of letter 
besides continuing to use the thousands of characters required for writing 
in the alphabet -included logographic system. There are, accordingly, two 
co-existent but not competitive writing systems in modern China: the 
traditional characters used by all literate Chinese, and variants of the 
Latin alphabet as by specialized scholars. The Gwoyen Romatzyh (the 
National Romanization) and the new system used in Mainland China were 
intended to co-exist for a while with and later to supplant the Chinese 
characters as the national script, but have as yet not widely succeeded. 

The RussianCyrillic alphabet is used for writing Chinese by Russian and 
other Soviet scholars in much the same way as romanizations are used 
in the West. 

In alphabetic type, however, the Russian Cyrillic alphabet is the 
same as the Latin alphabet. The latter was stabilized in its full inventory 
of 26 letters less than a thousand years ago, though the history of the 
Greco-Roman-Greco-Russian alphabetic type goes back to the time when 
the Greeks borrowed letters from the Phoenicians who were using a 
different type of alphabet. In Greek and later usage, letters specifying 
consonants appear beside letters specifying vowels. The use of symbols 
to specify sounds is all that is meant by alphabetic writing. 

It is in this sense that the Chinese system of writing is said to be 
an alphabet -included logographic system. The .alphabetic component 
of some characters gives a phonetic hint (sound specification) while 
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the logographic component of the same character gives the lexical or 
morpheme specification. What is surprising to westerners is that the 
vast majority of Chinese characters do give some phonetic information. 

This is so contrary to the western bias ^bout Chinese writing 
that we invoke a fantasy to correct the bias — that the enormous difficulty 
in learning to read thousan<^-and thousands of characters derives from the 
erroneous assuniption that the characters are mostly without any phonetic 
hint as to their pronunciation. Suppose, now, that the Communist 
hegemony in mainland China had not merely threatened, but had persisted 
in its threat to replace the traditional alphabet*^ included logographic 
system with a self-sufficient alphabet which exists beside the former 
but does not now replace it. Suppose, then, that the Chinese characters 
had been entirely abolished in the second half, of this century. Suppose 
further that a twenty-first century committee of Chinese linguists were 
charged with deciphering the alphabet-included logographic writing in one 
library of untimslated text manuscripts; how would they go about it? From 
the point of view of statistical preponderance, the characters which include 
sound specification would be the most important set in the decipherment of 
an interrupted tradition in writing. If the decipherers were to neglect the 
sound- specifictition components in such characters, they would have no 
more chance of success than Sir Arthur Evans had in deciphering Minoan 
writing; it was only when Michael Ventris took the phonetic part of Minoan 
writing as his point of departure that he was able to show that Minoan 
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writing consisted of a juxtaposition of a self-sufficient alphabet {called 
Mycenean Linear B), and logograms (pictographic and quantifying 
logograms). 

But Chinese character writing is not a system which juxtaposes 
a string of self-sufficient alphabetic symbols (specifying sounds) to a 
string of logographic symbols in Minoan-like equations. Rather, it is a 
system which integrates sound symbols and lexical morpheme symbols. 

The overwhelming majority of characters in the Im ^ classes or 
categories of the Chinese lexicographers include sound specification. Two 
clear exceptions to the rule for the majority are the pictographic hsiang 
hsing class of characters, and the easy-to-remember 'simple ideograph' 
or chih shih class (e. g. one line marks one, two lines two, etc. ));but 
both of these classes are numerically negligible. Slightly more 
numerous is the 'compound ideograph' class of characters that more 
nearly fits the western image of subtle Chinese thinking - e. g. the single 
character for military (combining constituents for caught in the middle + 
to stop + arms ). Most numerous are classes of characters with alphabet- 
included constituents - i. e. with components that specify sounds. (But some 

of the latter do double duty simultaneously: some components specify both 

sound and morpheme. ) 

Why then is Chinese character writing not taken as another type 
of alphabetic writing for the most part? Would it not. sim^^lify m^ter^to 
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divide Chinese characters into two separate parts - one part made up of 
logographic classes (as hsiang hsing an d chih shih) versus all the rest 
(those including sound specification in the characters)? No, because the 
alphabet -included part of a Chinese character is always integrated in a single 
composite character with the logographic part; the alphabet -included part 
specifies sounds, but not self-sufficiently so. 

What are called simply 'alphabets' are really self-sufficient 
alphabets. Such alphabets are sometimes used with juxtaposed Chinese -like 
logograms, as the Arabic number 5, or ? for female . And 5 and ? are no 
doubt logograms because they specify words in any language that uses them, 
irrespective of the way the words sound: 5 = five in English but beq^ in 

Turkish. If we were to invert the question asked above in respect to Chinese 
writing (why not separate the alphabet -included part from the logographic 
part?), and address the question to alphabetic writing of any European 
language, as English, the question would be not unreasonable. Why not 
take English writing as a simpler but still Chinese -like type of alphabet - 
included logographic system, since the alphabetic inventory is limited to 
26 letters (not counting calligraphic variants, ac italic, boldface, gothic, 
etc.), and since the inventory of logograms is greater then 26 symbols 
if scientific symbols, such as ^ , are added to the mors widely used Arabic 
numbers? There is one fact that precludes this inversion. Because the 
alphabetic part of English writing - like Mycenean Greek “ is self-sufficient, 
it can dispense entirely with the logographic part (but not vice versa). 
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Hence English writing as a whole is a particular instance of a logograph- 
included (inclu^dable) alphabetic type. 

Is there any hope of developing a more Chine se -like logographic 
system of writing for Indo-European languages? This would liberate the 
writer and reader from mastering the different IE languages, and still 

m 

enable him to communicate in writing across language barriers. The 
sciences in general have attained uniformity in the logographic part 
of their writing. Why not extend this to ordinary writing? Such extension 
would be possible if syntactic uniformity existed among IE languages, at 
least among closely related IE languages. But such uniformity does not 
exist among these languages. That it does not exist is well illustrated by 
Bloomfield's example of the different syntactic and lexical structures of 
four languages for logographic 91 (ninety one in English, one and ninety 

in German, one and half five -times in Danish, four twenties eleven i n 
French). 

Uniformity in syntactic and lexical structure among the different 
languages of the Han Chinese family is sufficiently great to permit any 
given writing in the alphabet-included logographic system to be read in 
the same order, though with different pronunciations, depending on the 
particular language of the Chinese reader. This is what really lies back of 
the unique survival of Chinese character writing over four thousand years 
of use. This also gives good reason and good hope for the continuation of 
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Chinese character writing in the face of recent competition with a soon-to-be 
coexisting alphabet of the self-sufficient type' which reflects the different 
pronunciations among the different Han Chinese languages, and accordingly 
draws attention to the language barriers rather than bridging them. How 
well- or poorly - Chinese character writing bridges the language barriers 
among Han Chinese languages is another question. 

Related to this question is the relative educational cost of learning 
or teaching a self-sufficient alphabet (in which every Han Chinese language 
may be written) versus the cost of Chinese character writing (in which 
every Han Chinese language is written). If something like complete control 
in reading Chinese can be had from recognizing 3000 to 5000 characters, 
and if reading with the help of a dictionary is possible after learning 26 
hundred characters, the relative effort (from a western point of view) in 
learning to read Chinese and learning to read English is greater than a 
hundred to one. But to learn to write or read something directly as; morphem 
(in a composite character with sound specification) is quite different than 
learning the sound specification of a self-sufficient alphabet. To be literate 
in Chinese character writing is to be educated for two reasons (see 1. 2 for 
the second reason). The first reason is that learning to read extensively is 
the same as learning an extensive vocabulary. Not so in English; learning 
the three R's, including writing, does not nec ssarily imply an educated 



individual. 
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There is an alphabetic order in our self-sufficient alphabet, 
which children often chant when learning their ABC's. But no Chinese 
child is taught to chant the sounds of thousands of characters. Instead, 
the modern Han Chinese child is presented with one or both of two pedagogic 
devices. Common vocabulary terms that the child has heard spoken - 
as those for mother , father, come , go - are written in Chinese characters 
on one side of a card, and are illustrated by picture drawn by cartoonists 
on the other side of the card. The second and more sophisticated device 
also uses pictures, but they are arranged facing Chinese characters which 
are simple (requiring few strokes at first) and then, progressively, opposite 
those which are more and more complex. Besides such modern pedagogic 
methods as these, there is a traditional lexicographic classification of 
Chinese characters, discussed above. Both the traditional and the modern 
classifications of characters help in learning (a) how to write Chinese, and 
(b) the lexical resources of Chi^zese; (a) and (b) are learned in one operation 
in China. 

Karlgren conjured up a picture of different Han Chinese speakers 
reading one bulletin board in Shanghai. The Mandarin speaker and the 
Cantonese speaker stand side by side and they can not understand each 
other as they read aloud. But each can understand the single written message. 
To illustrate how this works specifically, we give one possible sentence in 
Chinese characters, and note that three rather than tvro spoken sentences 



are needed to show the closer relationship of Chinese character writing 
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to Mandarin than to Cantonese. 



home is not in America. 

This sentence consists of seven characters (written here left to right). 

reading aloud, the Mandarin speaker would sayi 
t'a te*^ ifia^ pu^ cai"^ mei^ kuo^ 

home not at beautiful country 
Later on, when out of sight of the writing, the Mandarin speaker might 

report what was written to his wife, repeating exactly what he had said 
when he read aloud, . 



In reading the same characters aloud, the Cantonese speaker would 



say: 



t'a 



tek 



in 



pat coi mei kuok 

Later on, when out of sight of the writing, the Cantonese speaker, 
reporting what was written to his wife, would not repeat exactly v/hat he had 
said when he read aloud. Instead, when not directly influenced by the Mandarin 
selection in writing, the Cantonese husband might report: 

m hai mei kuok 



It would be oversimplification to the point of falsification to say that 
the only spoken morpheme appearing unaltered in these three spoken 
sentences is mei^ beautiful (from the description of America, which 



IS ; ^autiful country ^ that the written character for beautiful is just pronounced 

3 

like this (mei ), and hence is especially appropriate in designating America, 
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which in English also has m as its first consonant, as it does in Chinese; 
that Chinese in addition combines the sound with a symbol — * a pleasant 
meaning ( beautiful ) . The more realistic way of stating this takes cognizance 
of how mei"^ is combined with other morphemes in modern Chinese. Thus, 
mei"^ is often used as a transliteration character for syllables of the [me] 
type. As a morpheme, mei- is bound in most cases, and is less frequently 
used when the message beautiful is intended than is the compound /hau^ 
k*an/ good looking . Accordingly, /mei^ kuo^/ scarely evokes the conno- 
tation of beautiful country when said or heard. 

1*2. All scholars in old China, and most scholars in modern China 
would expand, by one step, our preliminary identification (1.1, above) of 
what it takes to be educated in Chinese culture. 'To be literate in Chinese 
character writing' is only the first step — a step which has to be taken in 
order to learn the contemporary classical language known as Wenyen 
(= Wenli), as well as to read Han Chinese; hence Chinese character writing 
serves a dual function. 

An educated Chinese, then, uses character writing not only for writing 
and reading his own native language, but also for reading and writing Wenyen 
in effect an additional language. This is one aspect of multilingualism that 
continues in modern China. The other aspect of modern multilingualism is 
the increased spread of 'acquired Mandarin' for the vast majority of 
speakers — namely, those whose mother tongue is any other dialect than 
Peking Mandarin. Thus, a person born and educated extensively 
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in Shanghai will (a) speak Wu as his mother tongue, (b) be able to make 
himself understood, with some help from character writing, in his own var- 
iant 'acquired Mandarin' (1.3), and (c) be able to use the same character 
writing for enjoying Wenyen. 

There are important differences between Wenyen as a contemporary, 
classical or literary language in respect to any Han Chinese language, and 
Greek or Latin as remote classical languages in respect to any other Euro- 
pean language. To form scientific neologisms in English (e.g. 'cybernetics'), 
we generally borrow from the lexical resources of one or both of our remote 
classical languages, but do not expect educated people in general — much 
less literary artists — to begin to control them as do specialists ('classical 
scholars’); Shakespeare was said to know little Latin and less Greek. Educated 
people who are literate in any Han Chinese language have become so partly 
in order to be able to read and control the massive corpus of Wenyen which 
encapsulates, with the utmost terseness, a corpus of wisdom, as well as a 
lexicon that both overlaps and supplements dictionaries in the Han Chinese 
languages, and a structure that differs from all of them, but more from 
Mandarin, for example, then from Cantonese. Those who do not read are 
not entirely excluded from the Wenyen literature. Blind beggars can chant 
or quote such fragments as they have heard and memorized. Though speakers 
conversations only in one of the Han Chinese languages, they may 
interlard an expression from Wenyen; or compose such expressions, for the 
sake of their brevity, in official government and legal documents, in business 
letters, often in private letters and in some news dispatches, and in virtually 
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all newspaper advertisements and telegraph messages. 

In contrast to the flamboyant style in which many of our advertisements 
are written, the use of Wenyen enables an advertiser to save space, or say 
more in the same space. Many words which require two characters in Han 
Chinese can be written with one character in Wenyen. 

The cryptic effect of saving time and effort (and cost per word) that 
comes from ellipses of words in our telegraphic style is not entirely 
confined to telegrams in Western culture. It has been extended to writing 
some parts of botanical reports, for example. And the anthropologist 
Kroeber once experimented in publishing one of the California ethnographic 
surveys in telegraphic style. But such extension beyond telegrams seems 
to lack elegance, if not grammaticalness; Kroeber, for example, had to 
abandon his experiment in subsequent survey reports. 

Chinese telegrams written in Wenyen achieve (a) greater compactness 

than is obtained in our telegraphic style, and (b) bear the flavor of an elegant 
literary tradition. 

There are associations other than literary elegance which are conducive 

V 

to the use of Wenyen as an on-going contemporary classic. Political mottos 
and slogans in Wenyen used to have an aura of being more worthy of respect 
than they would have been if they were given in any Han Chinese language. 

Sun Yat-sen's political mottos werein Wenyen. For any motto to be effective, 
the shorter the better, if the connotation is precise; this condition is met 
by Wenyen, but also by contracted Han Chinese. Since such leaders of 



Anthropological Linguistics, Vol. 6, No. 3 



Communist China as Mao and Chou En*lai are counted as scholars, they 
would be able to quote from Wenyea. That one cannot be a good orator without 
quoting from Wenyen literature reflects an obsolete cultural value; knowledge 
of Wenyen is still respected, though exhibition of this knowledge has been 
said to have become politically inadvisable. Nevertheless, it is known that 
Mi is able to write elegant Wenyen poems; and that he still does write them. 

About a generation ago, the serious study of Wenyen was begun by 
children still in grade school (e.g. in the seventh grade). Today Wenyen 
is studied by advanced studentsin universities. Whether or not this will 
transform Wenyen from a contemporary classic to a remote classic (like 
our Latin and Greek) must remain an open question. 

The literary legacy from Confucius (6th century B, C. ) to the end 
of the Ching dynasty (1644-1912) was written exclusively in Wenyen. Modern 
Chinese scholars learn to compose in Wenyen after the model of writers whose 
style differed from age to age. But in the Chinese apperception this corpus 
is so unified in structure and lexical selection that two instances from it — - 
as vddely separated historically as a century ago and over two millenia ago — - 
are regarded as equailly representative of Wenyen. 

The structure of Wenyen and of Han Chinese in general is sufficiently 
different that the two would not be confut, For example, one of our Chinese 
students was writing to her mother about clothing; she was asked whether 
she could save space by using only one syllable (either yi^ - or - fu^) — as 



in Wenyen — or whetl^er she would write both as a, redundant compound for 



Sino-Tibetan Fascicle One 



33 



clothing (yi - fu ), Her response was that her mother would be shocked 

if she should write one character (one syllable, either yi^ or fu^) for 

clothing ; her mother would say something like, "Oh, my daughter has lost 

her politeness and her education. " She might think that her daugliter had 

1 2 

forgotten how to say yi • fu , and was writing one or another because she 
was talking that way — ungrammatical in Han Chinese, but grammatical in 
Wenyen. The mother would never suppose that her daughter was attempting 
to write in Wenyen, since the rest of the letter in question was written in 
Mandarin. 

Derk Bodle says it is necessary to "write only the single character 
» meaning ’clothing’, or the single character fu, meaning the same" — 
but this holds for Wenyen, not for writing in any one of the Han Chinese 
languages. China's cultural tradition becomes blurred when the distinction 
between the Wenyen literary language and Han Chinese is not made: "The 
spoken language avoids such ambiguities ['syllables pronounced identically 
but having different meanings, . . , however, by combining two syllables, 

similar in meaning, into a dissyllabic compound, and lettiag the compomid 
stand for the idea which is common to both. . . yd and fu, for example, both 
mean 'clothing' (in addition to many other things); hence the spoken word 
for 'clothing' becomes yi-fu, a compound which will not readily be confused 
by the ear for any other compound. . . it is necessary to write only the single 
character yi, meaning 'clothing', or the single, character fu, meaning the 
same, but not both of them put together (unless, of course, one were to 
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write exactly as one would speak, but this, as we shall see, was not 
customary in the old China). ” But it is customary in modern China, to 
write redundant compounds like yi^ - fu^ for clothing " clothing as one speaks 
them, unless one is composing in the Wenyen the literary language which 
differs from the language of any Chinese speech community. Since 1917 when 
Hu Shih persuaded people to write in plain talk (pai hua), %the language of the 
community, most scientific writing, editorials, novels, dramas and love 
songs have been composed in written vernacular ( 1. 3) . 

1. 3. The double linguistic directive of Communist China 
amounts to this: (a) write as you speak ^i.e. not in Wenyen); (b) speak 
Mandarin (’acquired Mandarin’, if your mother tongue is some other Han 
Chinese language). The two parts of this directive are closely related; it 
would not do to say generally (a) ’write as you ^peak’, if you speak only 
Cantonese; nor is it enough to say (b) ’ speak Mandarin’, for there are three 
or four different major groups of Mandarin dialects — though these are, for 
the most part, mutually intelligible. Hence the expanded ^rective for (b) 
does not include all of the Mandarin language but only that part of Mandarin 
called / p’u^ t’un^ hua^/ common language (p’u^ t’un^ hua^ is synonymous 
with kuo u national language ) . 

When so restricted, the common national language turns out t®be 
a mother tongue for the relatively few (a slight variant of how people 
speak who are brought up in Peking), while the vast majority face the te.sk 
of learning ’acquired Mandarin’, even if they are speakers of non-Peknig 
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;ts. XX wiey are native speaKers oi a language far away in 
the south of China, the aim is not to acquire just any Mandarin dialect; the 
model of ’acquired Mandarin' is always the Peking dialect of Mandarin. 



Speakers who already know some other dialect of Mandarin would more closely 
approximate this model to begin with and would, therefore, have a much 
easier time acquiring 'acquired Mandarin' than would speakers of other 
Han Chinese languages. 

Still, we know of no instance in which an educated speaker of one of 
the other languages has failed in an attempt to learn 'acquired Mandarin' to 
the point of becoming more or less intelligible to a Peldng Mandarin speaker 
(though rarely without showing traces of his non-Mandarin native language, 
or even of his non- Peking dialect of Mandarin). The speaker of 'acquired 
Mandarin', if educated, can use character writing that he and the Peking 
Mandarin speaker share as a crutch in communication; either by writing in 
sand, in dust, on paper or on the blackboard; or by tracing the characters 
on the palm of one hand with the finger of the other; or by tracing the 
characters on a table with a chopstick dipped in soup; or even by tracing the 
characters in thin air. 

In the second half of the 20th century, the speakers of a half dozen 
languages in Mainland China are again faced with accepting — or 
rejecting — virtually the same politically sanctioned Mandarin dialect 
that was in political favor in the last dynasty of China (1644-1912). The 
foreign Manchus (Mongols) required officials from all the provinces to 
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people as language of the officials (kuan hua ). It functioned then as a 
lingua franca , while today there is apparent intent to have it replace the 
other Han Chinese languages. This may be regarded as merely programmatic, 
but it seems probable that no cultural shock would be felt in modern China by 
official insistence on the use of one dialect of one language — whether called 
a common language or national language (or as formerly, language of the 
officials); and it seems equally probable that a culture which has had much 
experience in official pressure of this kind has also learned how to preserve 
its languages and its literature. 

A politically sanctioned dialect of Mandarin has for long been one of 
four supports of the ethnolinguistic monolith here called Han Chinese. Of 
two older supports of this ethnolinguistic monolith, one (Chinese character 
writing) is now competing with a self-sufficient alphabet, while the other 
(Wenyen) is being transformed from a contemporary classic to a remote 
classic. In comparison with ’acquired Mandarin', character writing, and 



Wenyen, the fourth support of the ethnolinguistic monolith is the strongest of 
all — * the syntactic uniformity which lies behind the phonological and lexical 
diversity in a half dozen different languages, each made up of mutually 
intelligible dialects. 

Chinese who are able to read the Wenyen literature with understanding 
(1. 2^ above) have necessarily mastered Chinese character writing (1, ^ i above) 
since the corpus of Wenyen is written exclusively in characters. Such Wenyen, 
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when read aloud, is pronounced as the characters are pronounced by a speaker 
in his own language today, and no attempt is made to reconstruct former 
speech. Accordingly, the same Wenyen passage is pronounced differently 
by a speaker from Shanghai than by one from Peking. A given Wenyen passage 
may be intelligible in written form (character writing) but not intelligible in 
spoken form. If such a passage is unintelligible when pronounced, it would 
also be unintelligible if the pronunciation were written alphabetically (since 
all that alphabetic writing does is to specify sounds of pronunciation). To be 
intelligible, such a passage in Wenyen has to be either (a) written in character 
writing, or (b) translated from Wenyen into Wu or Mandarin, after which 
It can be written in alphabetic writing and still remain intelligible. Alphabetic 
writing and character writing are both possible for Wu, Mandarin, and other 
Han Chinese languages. Alphabetic writing is also suitable for Wenyen. Two 
Belgian. Jesuits in their Romanigation Inte r diale ctique , based on Karlgren’s 
reconstructed Ancient: Chinese, set up an alphabet (26 letters) which could 
write Wenyen or any Han Chinese language. If character writing were wholly 
given up in favor of alphabetic writing, Wenyen cot-ld still function as a con- 
temporary classic language. Nor would Wu, Cantonese, Mandarin and the 
others suffer if written in romanization rather than in characters. Each 
symbol of the latter (character) is isomorphic with an alphabetic symbol 
sequence (syllable), by and large. However, since the shapes of the syllables 
are sufficiently different among the different Han Chinese languages to 
constitute a language barrier between them, it is necessary to say which 
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Chinese language is being written when the writing is in any romanization 
(or any other adaptation of the self-sufficient a.lphabet)« 

It is also necessary to distinguish between strings of syllables which 
represent written vernacular, and those which represent the actual talking 
of a Han Chinese language •— talking not under the influence of Wenyen 
character writing or western educated bilinguals. The use of romanization 
is often but erroneously thought to represent exactly the way a Chinese 
language is spoken. The effective innovator of modern Chinese writing, Hu 
Shih, used to say that his writings in the vernacular were not, nor intended 
to be, in the form of actual speech ('only Chao writes the real spoken lang- 
uage'). The actual speech, the real spoken language is hereinafter called 

sSszfS vernacular represented in romanization or in 

will be called written vernacular. 

It is interesting to note that Hu Shih played up the use of /pai hua/, 
but played down the use of romanization. Because Hu Shih expected to write 
/pai hua/ in characters for some decades, he felt no urgency in facing the 
knotty problems involved in romanizing Chinese languages and dialects. 
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— acquired Mandarin', as identified above, the written vernacula r 
g ives an overall impression of being somewhat more contracted of utilizing 
fewer and sometimes different syllables - than the spoken language . Both 
share the same general structure; neither permits utterances quite as terse 
as those possible in Wenyen which is structurally more different from either 

written or spoken Mandarin than Mandarin is from other modern Chinese 
languages. 

A speaker of 'acquired Mandarin' might say, 

.X .^£2, wJ^itin^ fbis letter in a slap - dash manner , 

(1) uo^ ma^ xnaP hu^ hu^ te^ Sie^ 

QJa [( horse j: jiorse ) (tiger ± tiger ) -Iv 1|^ [write 1 [this 

fe^ ^ §in^ 

classifier letter!.. 

— — Jd 

The literal translation of all sentences, here and below, matches one English 
word (the gloss) for each Chinese morpheme — and Chinese is said to be 
monosyllabic because most morphemes are exactly one syllable in length, 

Thus, one phrase, above, includes one morpheme Sie^ which is glossed as 
[wr^]. To mark the past, an additional morpheme would he included in the 
phrase, Sie le , and the phrase would be glossed [write perfective], 
reflected in free translation as .1 h^ written etc. In a compound some 
monosyllabic morphemes have literal glosses which only hint at or imply the 
referent for the compound as a whole. Others are not paradoxical, are quite 
explicit and even redundant, and hence shov/ a better semantic fit with the 
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meaning of the compound as a whole. The corresponding literal glosses of 
members of a compound are enclosed in parentheses. So also are the glosses 
of the repeated morphemes in reduplication. And a combined compound and 
reduplication is also enclosed in parentheses, as in sentence (1), where the 
literal glosses are ( horse + horse ) ( tiger + tiger); the refer’ent for this re- 
duplicated compound is explicitly stated in the free translation which precedes 
sentence (1): in a slap - dash manner . 

♦ • . * 

Though the speaker would certainly say sentence (1) in s poken languag e - 
and be always understood by anyone who comprehends any Mandarin dialect- 
he would be more apt to substitute ( confused - cursive) for ( horse + horse) 
( tiger ± tiger ) in written vernacular , as is shown in sentence (2), 

I am writing this letter careless ly. 

3 2 3 O 3 ^1^4 

(2) uo liau c* au te ; sie fer} sin 

Cl] [( confused-cur sive) - lyj ^ [write] ^ [this classifier letter ] ^ 

2L 

Sentence (1) is uttered in ten syllables; sentence (2) is written in eight. The 
single difference lies in the spoken language selection of a reduplicated 
compound for (1) — ( horse + horse ) (tiger + tiger); and in the written langua ge 
selection of a simple or non- re duplicated compound for (2) — ( confused- 
cursive) . But, of course, sentence (1) can be written (as indeed is apparent;, 
above); and sentence (2) can be uttered. But sentence (2), when uttered, 
would be less widely understood by Mandarin speakers than sentence (1), 

The first use of enclosing one or mere glosses in brackets (andxabel- 
ling each bracket with a subscript letter), is to permit comparison between 
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closely similar sentences. Thus, sentences (1) and (2), above, are identical 
in brackets a, c, d. Thi^y differ only in bracket b. 

The second use of ei:closing one or more glosses in brackets is that it 
permits easy exposition of where the corresponding Chinese groups of mor- 
phemes are reorderable either in non-contrastive transformations of the 
same message, or contrastive transformations which change the message. 
There is an old stereotyped explanation, now generally abandoned, according 
to which some languages have more grammar than others. Languages which 
permit free reordering of any units, as in the non-contrastive syntax of 
Chinese, were supposed to have less grammar than, say, inflectional Ian- - 
guages which have fixed limitations on the distribution of affixes flanking stems 
in word morphology. This stereotyped explanation was abandoned once the 
20th century notion of levels within the grammar of a particular language was 
developed. In general, it was then clear that languages might differ in being 
grammatically more limiting at one level than another, rather than having 
more grammar in the whole of one language and less grammar in the whole 
of another language. But the older view of Chinese can be restated in terms 
of the new recognition of levels; Chinese shows more grammar (limitations 
on distributions) at some levels, and less grammar at ether levels (hence 
more freedom, or choice, or possibility of reordering without contrast). 

4 

Since brabketS as units are often reorderable (i.e. permutable), without 
change in message, it is at this permutant level that restrictions on distribu- 
tion are least in evidence. Thus in sentence (3), which follows, there are a 
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half-dozen brackets; if now we hold the first bracket and the last bracket in 
constant position, it is possible to vary the positions of the four intervening 
brackets, labelled with subscripts from a to d, with the general message 
remaining constant. 

He has never played happily at home with his own children . 

(3) t*a^ pu^c'eg^ kau^ kau^ §i^^ Si^^ qai^ ^?ia^ 

FheL fnot eve rlu [(high 4 high)-6spirit + spirit) -Ivl [at home 

3 t I 1..2 ,0 0 2*3 

x-.cu;) . Ken hai c-t men uan sua 

middle ] [with (child - formative ) plural ]^ [ (play juggle )lj. 

The referent for the reduplicated compound of bracket c is happy ; the 

21 £ 

ordering of the intervening brackets is [he ] bed e [( played )] . There are 
some half dozen possible permutations in the reordering of the four interven- 
ing brackets, namely; 

a £ 



(3.1) [he ] b d c e [( blayed )] 

(3.2) [he ] b e c d [( played )] 

(3.3) [he ] d b c e [( played )] 

(3.4) [he ] d e b c [( ittayed )] 

(3.5) [he ] e b c d [( played )] 

(3.6) [he ] e d b c [ (played )] 

(3.7) [he ] b e d c [( played )] 



Though these reorderings are possible, they are not selected at random. 



nor arbitrarily, nor by whim of the speaker. They exemplify the principle 
that the minimum syntactic span is more than one sentence in length, for the 
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orderings in (3.1) to(3.7 ) are all resoonse sentences, i.e., sentences uttered 
in response to a stimulus sentence which has its intervening brackets in an 
order parallel to that given in the response sentence. The general principle 
is; Say unto others as you have heard others say to you. This is an extension 
of Chao's rule of question-answer word [i.e ., bracket ] order; 'Ask as you 
would be answered. ' That is, whatever the ordering of brackets in a question 
(or other kind of stimulus sentence), a parallel order will be given in the 
answer (or other kind of response sentence). 

Thus, if someone says (stimulus sentence — here given in terms of its 
glosses in brackets), [h^]^ [has always 1 [at ( schoo l) ini [(happy)-lvl 

U Q Q 

[^ith ( student ) plural ]^ [( played )]j; you select the b d ^ e ordering of sen- 
tence (3) brackets (response sentence (3,1)); [1^]^ [not ever l^ [aj home mid - 
^Id [with (child ) plural 1 [( played )1. However, if someone 

^ 6 I 

says (stimulus sentence), [jie [not ever ] [with (wi f e )1 [at home middle] 

[(happy )-ly [ (chats )]j, you select the parallel b e d c ordering of sentence 

(3) brackets (response sentence (3.7)); [1^] [not ever], [with (child) plural] 

[at home middle 1 [ (happy •[■ tIy] [ (played )1 . And SO on, for the other re- 

Q ^ e 

sponse sentences with still different orderings of intervening brackets. 
Accordingly, the possibility of reordering without change of denotation in the 
message appears to be not entirely free, since it — and »=»tress and intonation 
as well — parallels the choice in ordering of the preceding speaker. But it 
is relatively free. Many alternatives are possible; but only one alternative 
in the response — the alternate which echoes order, stress, and intonation of 
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the . stimulus i?j polite .as well as correct. 

The possibility of reordering brackets with change of message also 
exists. This second kind of reordering is relatively limited, and its restric- 
tions are not influenced by the parallel syntax of the preceding speaker. It 
is in fact much like functional word order in English (e. g. , [tiger 1 is subject, 
or topic before the predicate or comlnent [killed the man], while [man] is 
subject or topic when preceding the predicate or comment [killed the tiger ]). 
But topic is not alway.s translatable as English subject, a? I comment — unlike 
English predicate ‘—may include embedded topics . Also, the topic - comment 
level is explicable in a syntactic span that is longer than one sentence. We 
return to the topic - comment level after considering lower levels — shorter 
spans of morphemes within the brackets. 

It is the bracket as a whole that is reorderable and, therefore, some- 
times analogous to a 'linguistic word* in languages with a 'word' level. The 
single morphemes within a bracket, however, might be said to represent 
'ethnolinguistic words*, since children are taught to learn and tc write single 
morphemes — separate characters in the traditional writing system — and 
adults pay fcr telegrams according to the number of single morphemes rather 
than according to the number of brackets in a message. There is a Chinese 
label for the single morpheme (equivalent to the single character ) — — 

out no label for the bracket, our reorderable unit, The reorderable bracket 
•and the single morpheme coincide when a bracket includes just one morpheme, 
as in «he first bracket oi each sentence given above, and as in >>racket in 
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sentences (i) and (2). All other brackets in sentences (1), (2) and (3) in- 
clude more than one morpheme. 

Literal glosses are given for the single morphemes, even when in paren> 
theses; • but the referent or connotation of the reduplication and/or compound 
in parentheses is given ir free translation, as in the last bracket in sentence 
^ ^ ^ JUfigle)] meaning play or played . 

In text or discourse frequency, as would be expected, the morphemes 
that always function as lexical or major morphemes (capital M, in our nota- 
tion) are most conspicuous. Scarcely more than one in five — and fewer in 
inventory count than in text .frequency - function as grammatical or minor 
morphemes (small m, in our notation), analogous to functive words and affixes 
(suffixes or prefixes or infixes) in languages with a word level. But, most 
surprising of all to Western eyes, a very large fraction of single morphemes 
function alternatively as minor morphemes in some brackets and as major 

morphemes in other brackets (j^? ~ m* notation, as exemplified in 

sentence (4), below) . 

It becomes immediately clear, when applied to English examples, how 
the notation works when lexical or major morphemes are represented by M, 
when morphemes in the grammatical inventory (mostly suffixes) are repre- 
sented by m, and when morphemes which function alternatively as either — 
but in different sequences - are marked by M . ^hus English repeatedly 
represents a sequence M-m-m, since a lexical or major morpheme, repeat 
(M) is followed by -m-m, two minor morphemes which are always suffixes. 
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*Once a suffix always a suffix' is a rule which holds generally for 
English, but some few suffixes(minor morphemes) appear also as lexical or 
mcfcjor morphemes (or vice versa). Notation used for instances of such over” 
lap, is when the morpheme is functioning as a suffix, but ^ when the same 
morpheme is functioning as a lexical or major morpheme. Thus, in full of 
beans and less than half , both full and less are functioning as major mor“ 
phemes. However, we represent both as ^ because both full and less may 
also function as minor morphemes, as in carefully , carelessly . Here - full - 
and ” I e s s ” are represented as ^ and the sequence as 9- whole is represented 
as M-j^-m. Occasional parallel instances of ^ ~ ^ found when a suffix 
or sequences of suffixes are uttered as words, generally as technical words — 

isms from socialism and the like, emic and etic from phonemic , pho " 
netic and the like . 

The rule, once a suffix always a suffix' is a general rule for English, 
but not for Chinese. The total inventory for minor morphemes m's which 
always function as grammatical or minor morphemes — is surprisingly 
small. The total inventory of major morphemes — M's which always function 
as major morphemes forme the largest fraction of lexical morphemes in 
the dictionary. But hundreds of morphemes belong equally to (a) the gram® 
i:nar — —when they serve a grammatical function; and to (b) the dictionary 
— when they function like other lexical or major morphemes — that is, like 
the majority of M's. 

It is only a Western bias to say 'a language should not be confused with 
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its dictionary* or that * the dictionary of a language is an appendix to its 
grammar*. In the Chinese view, all instances of ^ M belong simultane- 
ously to the grammar and the dictionary. And instance., of - M are numer- 

M m 

ous and highly recurrent in any text. Instances of m*s (minor or grammatical 
morphemes which keep their grammatical function in all occurrences) are 
highly recurrent, to be sure, but not numerous in inventory count. So far 
as our sample is predictive for 'acquired Mandarin', it suggests that almost half of lj 
occurrences of morphemes in texts belong simultaneously to the grammar or 
minor morpheme inventory and to the major morpheme inventory ( ^ ) , where 
the meaning is generally altered. 

Thus, ferj^ is glossed as classifier relevant to envelopes or small 
packages, as in sentences (1) and (2) above, and (4a) below, when it func- 

tions as gj. But when functioning as M, fep* is glossed as seal or sealer as 
in sentence (4) , 

He has sealed that letter . 



(4a) 


t'a^ 


fer,^ 


le° 


.4 

nei 




ffin^ 






he 


seal perfective 


that 


classifier letter 






[MJa 


[M 


™lb 


m 


i 


Mlc 




The 


eiiveloDe 


^ is on the desk. 










(4b) 


5in^ 


fer^^ 


.4 

cai 


Ty. 1 

cuo 


5a^^ 








(letter 


sealer) 


at 


t9,ble 


on 








[M 


Ml 


rM 

•-m 


M 


m-i 








Here and in the following sentences 


the glosses 


for compounds and for 



o 

ERIC 
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reduplication are enclosed in parentheses, but otherwise all notation (inclu- 
ding that for brackets, M, m, and M ^ placed in the third line under 

the uncluttered Chinese morphemes and their literal English translations 
(glosses/; the free translation precedes the numbered sentences. The mor 
pheme fegl appears as the verb seal and as the classifier for letter in sen- 
tence (4a); it also appears in the compound for envelope (Sin^ (letter 

^ ealer ) in sentence (4b). The compound including gin^ does not occur with 
as classifier. 



Simple compounds are represented as (M M) when each member can 
also appear as a single major morpheme M, not in compound. The last 
bracket in sentence (3) provides an instance of this: ( play juggle ), where 
two different free major morphemes denote play . They are free because 



each M may appear out of compound, uan^ to play , Sua^ to juggle . 

The first or second member of the simple compound, or even both 
members, may be ^ instead of M (see above), as in the first bracket of 
sentence (4b) where ( letter sealer) means envelope — an instance of (M ^) . 



When the first member of a compound 



a uouna jvi- or 









bound -M, or when both members are bound, the simple compound is repre- 
sented as hyphenated (M-M) . Thus, the M- f©?^ c©mius@d is fecund 

while the M for cursive (c'au^)’ is m the oimplQ ( cosi- 



fused - cursive ) for careless in sefileag® f2)o 

Whether 'the simple compound is the fM M} ©5? fM-S 
relationship of the literal glosses t© what the e@i?ajp©iaad dea©?;Go en • . 



©IK . 
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may be either self-exlanatory or even redundant, on the one hand, or para- 
doxical in the sense that the literal glosses, especially when coordinate, pos 
a paradox which is resolved in the meaning of the compound as a whole. 

Thus, the reduplicated compound in sentence (1) may be reduced to a simple 
(M M) compound, (hor se tiger ) , meaning carefree, not particular , not fussy 
Other simrjtle compounds with coordinate members are more or less like this 
(east west ) meaning something (ti^^ li®); and (horizontal vertical) meaning 
any way (heg^ su^); and ( gome go) meaning checking account (lai^ uaT}^). 

So also, ( to beat to sort) meaning to clean up (ta^ lu^); this is an instance of 
(M M) which may be shortened to the second member which alone is less 
specific than the compound (lu to sort, to put in order) . Some coordinate 
compounds are self-explanatory in the -ense that the meaning of the whole 
may be inferred from the meaning of the members a ( to repai r to fort) for 
to fix up (siu li ); this instance of (M M) may be (Shortened to or expand- 
ed from — the first member of the compound (siu^ to repair, to mend) . 
Fxam.ples of the self-explanatory kinds of simple compounds have appeared 
in the preceding sentences: ( confused- cur si ve) meaning careles.s ; ( play 
jug^gj^) meaning play; ( letter seal ^r ) meaning eiivelope . In some of these 
self- explaiuilory compo\inds, the fir«t member is subordinate to the second. 
In some, the glosses of the two members overlap in meaning, and then the 
two members are again coordinate. Thus, redundantly, (cl othing - clothing) 
means clothing (i fu ); and ( body - body) means body (sen^ t’i^); and 
(friend -friend ) means frien d (pen^ iou ); and ( mad raving ) means insane 
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feg k'uag ); and ( look see ) means to see (k'an^ 5ien^). If the first mem- 
ber is modifier in respect to the second member, the compound may be 
either paradoxical or self-explanatory as ( move thing ) meaning animal (tu^"^' 
u^); as (ground diagram ) meaning map (ti^ t'u^). Compare ( offending per - 
criminal , and (interior person ) for wife (of speaker) , where the se- 
cond member is the same —■ (fan^ ren^) and (nei^ ren^) respectively. So 
also, for the next pair, ( fierce hand ) means murderer (Sioi)^ 5ou^) and 
(r^e hajid) means a good shot , an expert marksman (c*ian^ Sou^). For the 
last compound, reversal of order is possible: ( hand rif?^) for revolver (§ou^ 
Sia^ ) • In fact, this is the only compound cited so far in which the order of 
(M| ^ 2 ) can be changed to (M 2 M^), and still remain a compound. In (new 
smell ) for hews the second member is glossed for its value in spoken lan-“ 
guage, but is closer to the meaning of the whole compound if it is glossed for 
its Wenyen value ( new hear ) for news (iin^ uen^). If the speaker does not 
know Wenyen, he has to learn this compound without the crutch of analysis: 
so also, for the following compound whose members can be interchanged 
without change of meaning: either (hear see ) or ( see hear ) for observe, 
observation (perception by ears and eyes, or by eyes and ears, but not by 
nose) — (uen^ ^Aerfi) - (5ien^ uen^) — even though uen^ out of compound 
.now means to sniff , to smell . Finally, w© list compounds in which the second 
member is related to the first as its object, as ( liow blood) for to bleed (lieu^ 
iCie ); as (move - body ) for to start on a §en^); and, more para- 

doxically, as (use - merit ) for to study hard (ion^ kup ^) . 
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In the preceding two-member compounds, the gloss selected for each 
member was influenced by the meaning that the morpheme in question would 
mark when out of compound. If the meaning that the morpheme marks when 
in compound were selected, then instead of (_fier_c^ hand) for murderer, we 

8^®®® ( crime hand ) , instead of s-i^sll) for news, (new hear), 

uen^ is fairly active in compounds. 

Triple compounds are marginal in inventory, or seem slightly out of 
pattern when used in some neologisms, but suggest no uncertainty or ambi-- 
guity in analysis of the immediate constituent type. A triple compound is 
represented as (M M) (M) or as (M) (M M) when the members are free 
major morphemes, as in ( season wait ) (wind ) for the monsoons (?*i^ hou"^) 
ffeg^), and as in (yellow hot) ( sickness ) for yellow’ fever (huar)^ re^) (pig^); 
(^[ellow hot) does not occur in sequence except before ( sickness ) . It is pos- 
sible to distinguish between a simple compound preceded by the major mor- 
pheme, for yellow in sequence with minor morpheme, and a triple compound 
in which M for yellow is the first member: (M-m) (M M) represents ( yellow - 
of) (cover c^) for yellow rented car [huan^ te^ (pau^ c'e^]. The same 
three major morphemes in triple compound — (M) (MM) but without the 
minor morpheme te — appear in 'acquired Mandarin', perhaps under 
Shanghai influence, as ( yellow ) (c over car ), meaning ricksha .r(huan^) (pau^- 
S e )]. When one or more members of a triply compound are bound, this is 
represented by hyphens, as (M - M) (- M) for ( descend - fall ) ( squad ron) 
meaning paratrooper^ (tuei^)]; compare ( descend - fall ) ( umbrella ) . 
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meaning parachute luo"^) (san*^)]; and the sequence ( cold - hot ) which 

occurs only in sequence ( cold - hot ) (tabulation) meaning thermometer [(han^ 

2 3 

5u ) (piau )] . Perhaps ( not have - wire) ( electricity ) meaning radio is an 
extended caique from wireless [(u • 5ien^) (tien")]. It is not certain whether 



the following example represents a triple compound (M) (M M), or a phrase 
in which the first member, glossed to send out , precedes an (M M) com- 
pound, [ to send out ( spleen steam )], meaning, in either analysis, to lose 
one*s temper [(faM (p*i^ 5*i^)]. 

Sequences of paired compounds also occur (see below) . So also do 
paired reduplications — i.e. reduplicated compounds. Both are extensions 
of simple compounds and simple reduplication. Sequences of same mor- 
phemes in redupL.cation and their glosses are inclosed in parentheses, as 
are members of compounds, but the relation of the repeated members in 2i 
reduplication is shown by different notation than the relationship of different 
morphemes in a compound. For bound morphemes, the plus sign is used 
when the morpheme is repeated in reduplication (M + M); and, as already 
noted, the hyphen is used for connecting different bound morphemes in com- 
pound (M - M). Most common kinship terms are instances of (M + M), 
though they ^.re reduced to single M in letter writing — in epistolary style — 
as substitute for first person, the writer's kinship term in respect to the 
person written to; and as substitute for second person, you , the addressee's 
kinship term in reference to the writer. But other than in letter writing (and 
in Wenyen), the majority of kinship terms a,ppear as instances of (M M) 
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reduplication. Thus, in instances specifying younger siblings! (brother + 
brother ) for younger brother (ti^ ti° ) , and feister + sister ) for younger s ister 
(mei^ mei^). So also in instances specifying older ’siblings, ( sister + sister) 
for older sister (5ie cie ) , and ( brother + brother ) for olde r brother (ke^ 
ke°). In all instances but the last, these M's appear not only in (M + M) re- 
duplication but also in (M - M) compounds, as (younger brother - younger sis - 
i®x) younger siblings of either sex , collectively (ti^ mei^), and as 

_sister- yo unger sister ) meaning sisters irrespective of relative age , 
collectively (5ie^ mei'^). On the other hand, though there is an (M - M) com- 
pound for ( older brother - younger brother ) meaning brothe r s irr e spective of 
relative age , the first member of the compound is not selected from the (M + 
M) reduplication, as given above, though the second member is so selected — • 
(Siog^ ti^) rather than *(ke^ ti"^)o 

The relationship of the repeated members of a reduplication is indicated 
by the plus° and- minus sign when the repeated morpheme is free — that is, 
also occurs out of reduplication and out of compound, as well as in (M + M) 
reduplication. Thus, as a free morphencie, t*ai*^ too much occurs before 
modifiers, including quantifiers ( as too much red , too many ); but in redup- 
lication, represented as (M + M) , ( too much + too much ) means wife (t*ai^ 

9 0 

t*ai ). For morphemes which may enter into such optional reduplication, the 
difference in meaning between the single morpheme and tlie s^edupUcated mor- 
phemes, when there is a difference, is quite diverse more ©r less para- 
do^cical, as in the preceding example; also, having reference to command or 
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entreaty or commiseration, as in the sentences immediately following; and 
also marking other notions, as distributiveness or liveliness. If the repeated 
M's In sentence (5) were reduced to a single M, the sentence would declare 

You are hel£ing that as cited, with (M +M), sentence (5) is imperative. 

You give that man a hand. 



. 3 
ni“^ 


fpar)^ pag^} 


.4 

nei 


1, ® 
ke 


2 

ren 


you 


(help + help) 


that 


classifier 


man 


[Ml 

3L 


[(M + M)], 

D 


rm 


m 


Ml 

c 



In sentence (6), the M for thin is out of reduplication. 
He’s very thin . 

(6) t'a^ hen^ sou^ 

he very thin 

[M] [m m] 



In sentence (7), th i reduplication adds the notion of commiseration, as in the 
free translation. 



He’s rathe 


r thin (poor 


thing ) . 


(7) t’a^ 


(Sou^ ^ou^) 


te° 


he 


thin + thin 


-l2 


[M] 


[(M + M) 


-m] 



no* 



But in a parallel reduplication, [he +fat) -Ijr], reduplication adds the 
tion of something jolly or lively, as when we say of a baby He's fat (cute). 

In sentence (8), the reduplication adds the notion of entreaty. Try and eat thi 
.to^ ( to see if it's sour or sweet or bitter or poison, or whatever in con- 
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text ) . 

Try and eat this . 

(8) k'an® 

( eat + eat ) look 

[(M ±:m) g-] 

Compounds also appear in pairs, as do reduplications. An example of 
the former is represented by (M - M) - (M - M), as for example, ( older 
brothe^r - younger brother ) -( plder sister - younger sister ) meaning^ siblings 
(sio^ ti )'’(5ie mei ), Examples of the latter are found in sentence (1) 
( horse + horse ) -( Aiger -f tiger ) meaning in a slap - d ash manner ; and in sen-* 
tence (3) ( high + high ) -( spirit + spirit ) meaning happy . The reduplicated 
compound marks a more liyely meaning (e.g. slap - dash ) that the simple 
compound (e.g. carefree ) , when there is a difference. The difference be- 
tween sentence (3), with a reduplicated compound, and sentence (9), with 
simple compound, is a matter of what marks liyeliness — marked by redup- 
licated compound in (3), and by a minor morpheme for very in (9), 

I*m very happy . 



(9) uo^ 


3 

hen 


(kau^ 




I 

mmm 


very 


(high- 


-spirit) 


[Mia 


[m- 


(M - 


M)]^ 



In all instances of compounds and reduplication, so far, the constituent 
members are represented by major morphemes (M's) . But two-member 
compounds 9 or conripound^^iike sequences of two members, also occur in 
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which one member is a minor morpheme, m or The most productive of 
the latter are formative compounds. 

In formative compounds the major morpheme is dependent. Thus, 



(child - formative ) means child (xai“ c-t^); and (g randchild - formative ) means 
g randchild (suen^ ); and (off spring ^ formative ) means son (er^ ci^ ) , All 



of these represent instances of (M - 



) . But compare the last instance for 

3 0 ., 



son w.th instances of (M - , as (female offspring ) for daughter (nu^ er^ ) , 

on the one hand, and with the members of this compound reversed, v^rhen the 

g loss for offspring is specialised to male offspring ,, as in ( male offspring " 

7 3 

female ) meaning offspring of either sex or both sex es (er^ nii ). 

When the first M in a compound is free, the formative represented above 
S'® often functions as — rather than as a formative; indeed, though the same 
mcrplip-ne, it is glossed — after free M — as potential and represented as 

^ in a simple (M compound. Thus, ( melon potential ) means melon seed 

13 2 

(kua ci ) in contrast to melon (kua^); and ( fish potential. ) means roe (u 

3 2 1 3 

ci ), beside fish (ii ); and (gun potential) means bullet (c*ian^ c*). beside 



I 



(^ iaig ); and ( female potential ) means woman (nS c+*^) . Similarly, when 
the members of the compound for woman "• are reversed, the new compound, 
^m " ^ » means offspring of either sex or 



..3. 

sexes (c4 nu ) . 



Localizer compounds in which the localizer has full tone are less com- 
mon t.ian formative compounds. The localizer, as second member of a com** 



pound, is represented by in (M ^) 5 ( heaven below ) means the world s 



^ m 



I 4 



(t ien iia ); (c ity inside ) means downtown (£’*en U"^); (ho rse abov e) means 
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immediately , at once (ma^ sail‘d). But « for localizer may also follov/ M out 

of compound, as in city in, reversing the prepositional order of English, in 

the city , and represented not as ^^[(M - ^)] but as [M - — [c*eg^ li^ ] » 

(Compare the same morphemes as a compound for downtown , above. ) Further-- 

more, morphemes here represented as ^ alternate as when they are 

M m ^ 

glossed somewhat differently. As sia^ is glossed down but as ^ sia^ is 

glossed descend ; so also, li is inside , within aii but is lining of garment 

0 4 

in the compound )l so also sat) is on, above as but is 

to go up , to ascend 2is ^ . 

Within the bracket, the major morphemes, M and and the minor 

morphemes, m and HLt which are free and not in parentheses, function as 

M 

parts of speech. So also, the parentheses for compound or reduplication or 
reduplicated compound count as one part of speech, though the members of 
the compound or the repeated morphemes in reduplication are not so classi- 
fied. It is only the part of speech of the whole parentheses that is relevant to 
the syntax of the phrase or of the sentence, though it would be possible to 
assign class membership to each member of the compound. Thus, the redup- 
lication (t oo much - - too mudi ) means wife and counts as noun, irrespective of 
til® form class of the repeated morpheme enclosed in parentheses (t’ai 
The compounds and reduplications represented by parentheses as wholes 
as well as major morphemes, M or out of parentheses ~ both distinguish 
four parts of speech; verb, noun, adjective, and adverb; and a fifth part of 
speech, pronoun, is distinguished by M alone (out of parentheses, since pro- 
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nouns are neither reduplicated nor enter into compounds). 

Within a phrase bracket, the minor morphemes, m or give specifi- 

JM 

cation or support to the major parts of speeclijj or otherwise complement the 
major morphemes, M or and some minor morphemes also distinguish 
what mighi, be called minor parts of speech: demonstrative, numeral, clas- 
sifier, as well as localizer, directional and other complements. 

In self-sufficient sentences, each phrase bracket includes at least one 
major morpheme, M or Stated conversely, no brackets have been cited, 
so far, with one or two minor morphemes, m or to the exclusion of major 
morphemes. Such phrase brackets do occasionally occur, however, but only 
in sentences which follow or are followed by a self-sufficient sentence in a 
longer discourse. This is one dimension of macro- syntax. When not self- 
sufficient, either the stimulus sentence or the response sentence may be 
linguistically dependent on a self-sufficient sentence in the longer discourse; 
or ethnolinguistically dependent on the context, as by pointing at something 
prior to beginning a sentence with a demonstrative functioning as pronoun 
(2!e*^. . « This . , , in sentence (14), below) . Such dependence, whether linguis- 
tic or ethnolinguistic, is a clear indication of macro- syntax. A phrase 
bracket which includes minor morphemes (m ^^ ^) exclusively is sure to 
be in macro- syntax, while in the narrower sentence syntax, all phrase 
brackets include a major morpheme (M or ^). 

In sentences and fthi s classifier letter ] represents a se- 

quence of two minor parts of speech (demonstrative and classifier) followed 




i 
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by a major part of speech (noun), in a phrase bracket M] — which 

functions as a noun phrase. The same two minor parts of speech are followed 
by a noun compound in parentheses in the self-sufficient sentence, 

This car d is £re^. 



(10) ce'* 




1 » 3 
k a 


s j • 4 

p . len 


hau^ 


. » 4 

k an 


this 


classifier 


(card 


sheet) 


(good 


look) 




M 


(M 


M)] 

m ^a 




M)], 

b 



But though the same two minor parts of speech may be the only members of 

a bracket phrase, ^]» without M for noun, the bracket as whole is a noun 

M M 

phrase, as in 
This one is pretty . 



X 4 
ce 


X 1 

cai] 


hau^ 


1 » 4 

k an 


this 


classifier 


(good 


look) 


rm 

■•M 


M^a 


[(M 


M)]b 



The difference between sentences (10) and (H) is that (10) is a self-sufficient 
sentence, while (11) is found only in macro-syntax, as defined above. The 
ethnolinguistic dimension of macro- syntax is well illustrated in the next 
sentence: standing before a collection of photographs or paintings, one 
might say. 

Every one of them is pretty . 

( 12 ) 







, 1 
tou 


3 

hau 


1 » 4 
k an 


(classifier ! 


classifier ) 


all 


(good 


look) 


[(m ^ 

M 




[m 


(M 


M)], 

b 
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Heard as an isolated sentence from the playback of a tape recording, one 
could not say from sentence (12) just what was pretty — it might be, on the 
one hand, a postca^rd, or photograph, or painting, or even sheets of blank 
paper, and it would be in this domain that one would first guess what refer- 
ence was intendedj on the other hand, since the particular classifier in sen- 
tence (l2^ is also relevant to other objects with extended surfaces, including 
beds, desks or tables, stools, rugs, and linen sheets, one's first guess would 
have been wrong if the actual reference was to a collection of stools or rugs, 
for example. The compound classifier here is, generically, a distributive 
collection of objects with extended surfaces . But the same reduplicated 
classifier also occurs before a compound noun in the first bracket of the 
self-sufficient sentence. 

All the cards are pretty. 



(13) 




k’a'* 


» • 4 

p len 


tou^ 


hau^ 


k an 


(classifier 


classifier ) 


(card 


sheet) 


all 


(good 


look) 


± 




(M 


M)] 

m'^a 


[m 


(M 


M)]^ 



All brackets which include a classifier function as noun phrase. Sentences 



(ll) and (12) are in macro- syntax, while (10) and (l3) are self-sufficient. In 
self- sufjiicient sentences, the order is either demonstrative or numeral be- 
fore unreduplicated classifier before noun ( 10 ) ; or else reduplicated classi " 
before noun (13); see also sentence (61) for adjective phrase between 
classifier and noun. In macro- syntax, the order is either demonstrative or 



numeral before classifier , as in sentence (11); or else repeated classifier 
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(reduplication) as in sentence (12), without following noun in either order, 
but with the bracket, nevertheless, functioning as noun phrase » Reduplica- 
tion of the classifier is ir compatible with prior demonstrative or numeral. 
Such restrictions are more closely associated with the classifier than with 
the noun (v/hich may occur with or without classifier) or the demonstrative 
or numeral (which may also occur with or without classifier). But unredup- 
licated classifiers are flanked by following noun and preceding demonstrative 
or numeral. 

Many classifiers may transformed from a classifier (a minor part 

of speech) to a major part of speech (as noun or verb ) . Thus, Sar)^ is 

transformed from classifier (^t above) to verb (“), as in (2an^ k*aiM to 

JVi m — 

open up . A better example is fe^^ which may be transformed from a verb, 
to seal (^) f t© classifier (^)t relevant to the domain of letters and small 
packages, as in sentence (4a). So similarly, pen"^ may be transformed from 
a noun (glossed in various ways — as cost , fiscal capital , and when in com- 
pounds, as same , origin , root , booklet ) to classifier relevant to a narrow 
domain “ to bound sheets of paper or cloth, as notebooks, magazines, pat- 
tern books, and ordinary books. 

Beside the hundred or so classifiers, which are transformable in this 
way (from ^ to ^)f there are relatively few classifiers which are never so 
transformed. Hence, the latter are represented as non-transformable minor 
morpheme (m) rather than transformable ^ . Thus, is relevant to the 
medium sized objects, animals, or a member of a pair; k^e^ to the domain 
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of plants and espacially trees; tuo*^ to the domain of flowers; k'uai is relevant 

to lumps or pieces, and may also be glossed dollar , but only in macro-syntax; 
4 . 

ke is relevant to the widest domain of all and is also substitutable as a general 
classifier in place of the more domain^restricted transformable classifiers. 
Still the domains to which ke^ are relevant are not entirely unrestricted •«» 
ke"^ does not occur before nouns for sheets of paper, nor before nouns for 
long strips as river or thread, flowers and trees, etc, , when preceded by 
numerals higher than one , though it may occur before such nouns when pre- 
ceded by the numeral one or a demonstrative. 

Sentence (14)^ following, shows a non-transformable minor morpheme 
anaffixal relationship to one of the major morphemes: M-m is glossed as 
and means my before the noun compound (heaven below) , translated 
freely as world in a sentence excerpted from a longer discourse. 



This 

(14) 


is my 

_ 4 

5e 


world, 

4 

• 

S-l 


3 

uo 


0 

te 


t* ien^ 


4 

§ia 




this 


is 


i 


of 


(heaven 


below) 




'•rm-'a 


[M]^ 


[m - 


m (M 





Minor morphemes affixed to major morphemes — as the instance repre- 
sented by M-m in sentence (14) — are very low in inventory count, but occur 
over and over again in texts. Some few are suffixed (-m); still fewer are 
prefixed (m-); and an occasional one or two are infixed (-m-). An example 
of the latter appears in echo reduplication which is a variant of the simpler 
kinds of reduplication and compounds noted above. Thus, the simple (M-M) 
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compound •*•“ (hu^ t’u ) glossed, as the members would be glossed out of 
compound, as ( paste -- smear ) for the adjectival verb, b lurred or muddled -- 
is transformed from verb part of speech to adverb part of speech, meaning 
in a muddled manner , good and muddled , blurrish , when the simple compound 
is wholly reduplicated (both members reprsated), and also when partly redup- 
licated (first member repeated, with intervening Infiac); (hu^ hu®) - (tu^ tu^); 
and also (with infix -U^-), (hu^ li® hu" tu^)j.the latter is an instance of echo 
reduplication. 

For an example of prefix for not alternating with verb for have no , 
compare (15), (16), and (17): 

I donh have (as response sentence). 

/ie;\ 3 .2 . 3(«'0) 

(15) uo mei lou 

_I not have 

[M]^ - MJb 

.1 have money . 

(16) uo^ mei^ iou^ ien^ 

— have money 

I haven *t any money . 

(17) uo^ mei^ tf’ien^ 

hctve no money 

But it is possible to reorder sentence (16) from a b c to c a b 






as in 
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English, from I don*t have money to Money I don’t have . Such reordering is 

2 

not possible in sentence (17) where mei functions as a verb before a noun 

included in the same bracket because the noun in this kind of sequence is not 

independently permutable. (One can *t end a sentence with mei . ) The same 

noun is included in a separate bracket when it is reorderable •— from last 

7 3 

bracket, as in sentence (16) to first braxket in sentence (16); [?ien^] [uo ] 

7 3 

[mei iou ] Money I don*t have . 

In the western tradition of alphabetic writing, space serves as a word 
marker. If a word contains unusually many morphemes — say a prefix (m- ) , 
stem (M), and suffix (-m) — the whole m-M-m sequence is still set off by 
space as, for example, un - usual - ly in this sentence. In the Chinese tra4i- 
tion of character writing, space — or whatever Chinese use as the equivalent 
of space — serves to separate successive syllables, and that amounts to 
successive morphemes in almost every case. This tradition of writing per- 
mits recognition of characters (by the device of keeping each character to- 
gether). Thus, separate morphemes by separate character tradition may 
be continued in modern alphabetic writing; indeed, it is reflected in the way 

the 'acquired Mandarin* is written here — by keeping morphemes apart by 

1 9 Z 0 

space, as in fei^ te^ un - usual - lY.- It is we who add the parentheses 

(to group together morphemes in reduplication of in compound). Just as we 
add the brackets (for phra,ses which include one or more morphemes), and just 
as we insert -,+, + as notation within parentheses for formulae (but not for the 
Chinese part). These are merely notational additions which still permit space 
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to serve as a morpheme boundary marker, irrespective of whether a given 
morpheme is always a major morpheme (M); or always a minor morpheme 
(m); or sometimes one and sometimes the other ^ ^)- 

There is no lack of clear-cut distinction between an M which functions 
as a major part of speech say the pronoun foiM (uo"^) and an m which 
is always a minor morpheme of the kind that never alternates as a major 
morpheme. Every instance of the latter (m) is an affix either a prefix or 
a suffix (or an infix, but there are relatively few infixes) . But though an 
affix, the morpheme represented by m is still separated from the other mor- 
phemes by space. Thus, in the case of the M-m sequence for I- pluralizer , 
meaning we , there is a space between the major morpheme (uo^) and the suf- 
fixed minor morpheme (men^): uo'^ men^ we . 

This is consistent and useful. It is consistent because space separates 
morphemes, not words. It is useful for languages whose affixed (m‘s) may 
have either a narrow dependence range in which the dependence extends only 
to adjacent major morphemes (hence as part of one 'word* in languages like 
English with a word level) , on the one hand; or on the other hand, whose 
affixes (m's) may just as characteristically have a wide dependence range 
in which the dependence extends to a whole phrase, clause, or sentence. 

The wide dependence range is not altogether unknown in languages with 
word level like English, but it is not characteristic of them. It is found in 
such expressions a,s "the King of England's hat" or, with an even wider depen- 
dence range for the same suffix (‘s), in the sentence that H. L. Mencken ( 1936 ) 



Anthropological Linguistics, Vol. 6, No. 3 



cites as a curiosity! "That umbrella io the young lady I go with’s," As 
Y,. R. Chao (I960) has shown, analogues of such wide dependence range for 
suffixes are not at all curious but instead just as characteristic of Chinese 
structure as the narrow dependence range exempLaied above by [l» pluralizer 1 
meaning we. A similar example is [l-of ] meaning mine in a phrase without 
following noun, or meaning my whatever the noun is, if a noun is included in 

*3 0 

the phrase (uo"^ te , before noun). Compare now two suffixe^s in narrow 
range dependence: [l“ pluralizer ~of ] meaning ours in a phrase without follow- 
ing noun, or else our (uo^ men^ te*^, before noun). 

The list of commonly heard affixes is a short one when it includes 
affixes whose dependence range is generally within a phrase span — i.e, 
within a bracket. Two affixes in this first list appear again in an even shorter 
' second list which includes a few other affixes whose dependence range is 
v/ider-extending to whole clause or even to whole sentence. 

The pluralizing suffix appears not only after pronoun, as [l- plurallzer ] 

3 0 

meaning we (uo^ men ), but also after some nouns in a sociolinguistic set 
v/hose generic is man as a human being, including status and role terms, as 
professor , student , worker , comrade in M-m sequence and in compound 
sequence, as (M - M) - m. Since nouns are non-committal as to number, 
the pluralizer suffix at least adds explicitness of number (in a collective 
sense). Thus, [ren^] person or persons but [ren^ men ] womenfolk^people. 

o 

cj 

Quantification by prior numeral is incompatible with pluralizing by suffix: 
(three classifie r persorx ] meaning three people (san^ ke ren^), but never 
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^^[ three classifie r person p^ralizer]. On the other hand, pronouns do mark 
singular person, and hence are really changed in number by the pluralizer 
suffix^ When the pronoun referent is to human beings, number is specified; 
number is non-committal, however, in pronominal reference to plants, to 
inanimate objects, or to some animals. Thus, means it or they in refer- 
ence to non- domesticated animals, plants, and things; but in reference to hu- 
mans and domesticated animals, its meaning is restricted to third person 
singular of either sex, as he •»' she . And the referent of the hl-m sequence 
is restricted to human (or domesticated animal) third person plural, as they 
(ta^ men^) . 

The pluralizer suffix has least to do with syntax, since its dependence 
lange is restricted to the pronoun or noun preceding it, in contrast to the 
subordinating affix;, te^, which appears in many environments. Thus, it 
appears in [M - m M] sequence where the M - m is modifier to the following 
M, either ' as possessive modifier or as adjective modifier. Thus, with pro- 
noun, [he ^of book ] means his book books (t'a^ te^ Su^); so also ivith noun, 
[ hor se-of foot ] means horse’s foot - feet (ma^ te^ Siau^) . But as adjective 
modifier before noun, [ gold *of money 1 means gold coin (^?in^ te^ Jf*ien^) in 
contrast to the noun compound where gold enters the compound meaning 
money ; hence (money money ) rather than [( gold money )] which means 
wealth , treasure , capital in the form of money (^in^ ^ien^). The same suf- 
fix appears in [(M 4; M) - m] sequence where the suffixed reduplication is 
adverbial modifier of the following verb phrase, which happens to be imme- 
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diately following in sentence (18), 
He runs quickly . 



00 


1 » *4 

k uai 


1 » . 0 
k uai'^ 


teO 


, 3 

p au 


he 


(quick 


+ quick) 


-ly run 




[{M 


+ M) 


■“lb 


[M], 



Between the 'adverbial modifier' and the 'following verb phrase', there is an 
intervening bracket in sentence ( 19 ) which encloses a localizing phrase, 

He running quickly on the street . 

(19) ta^ k’uai*^ k*uai® te^ cai"^ ma^ lu^ 

he ( quick ± quick ) - ^ ( Macadam road ) top run 

[M]^ [(M ± M) -m]^ (M M) [M]^ 

The brackets may be reordered from a b c d as given in sentence (19 y to 
a c b d, without change of message. The same morphemes that appeared 
in sentence (18) appear again, but in a different order in sentence (20) which 
has two messages, depending on whether t*a^ pau te is taken as topic be- 
fore the comment k*uai*^; or whether ta^ is taken as topic, and the rest as 
comment: 

His running is fast (i.e. As for him , running is fast ); 

He can run fast . 

(20) t*a^ p*au^ te^ k*uai^ 

In the first message He is fast , with modifier m manner of running before fast . 
In the second message of sentence (20), the suffix te^ means can , in the 
sense of potential ability to perform (as the preceding verb specifies) but in 
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the manner specified by a following complement ( fast ) . The subject is 
omitted in sentence (21), which begins with the object: 

This kind can be eaten ( is edible ) . 



(21) Se* 


ke° 


C’i* 


teO 


this 


' classifier 


eat 


can 


[OQ 


m] 

•■a 


[m - 





Here bracket^ is a verb phrase comment which does not include an embedded 
actor; while bracketg^ is a noun phrase topic functioning as object rather than 
as subject, which as already mentioned is not specified; hance the object 
phrase precedes the verb phrase. This is in one respect parallel to sentence 
(22) which also shows noun in topic before verb in comment, but with verb 
followed by final particle suffix "le^, and with two possible messages, de'- 
pending on whether the chicken in the topic is taken as prepared food, in 
which case it functions as the object of the verb; or as a living organism, in 
which case it functions as subject; 

for chicken (pause ) , not eating any more; or 
The chickens are not eating any more , 



(22) 


4 

pu^ 


w * • 1 

c + 




chicken 


not 


eat 


any more 




[m - 


Mlb 


m 



As a final particle suffix, -le^ is glossed any more in the sense of that^s the 
situation. In the first message, the topic functions as the object of the fol- 
lowing verb phrase comment, with actor not specified, in the order of object- 
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verb which is understood as such only because there is a pause between ob- 
ject as topic and verb as comment; if uo^ for actor Jwere inserted, the expanded 

fjentence (22) would unambiguously reflect the object- subject- verb order, 

eati^, analogous to money I have . In the second message, 
the topic functions as the subject of the verb phrase comment. In both mes- 
sages the dependence range of the -le° as final particle suffix extends over both 
topic and comment, and hence over the entire sentence. There are some 
other final particle suffixes which appear in sentence final or after topic, 
or after coordinate comment brackets, with the same wide dependence range — 

ipterrogaiLve, a^ exclamation , pa® mild imperative , tentative or sup - 
positional . and a few others. 

0 

So far, -le has been shown out of bracket, as a final particle; but it 
also appears within bracket when it is glossed as perfective . So far, also, 

-te has been shown within bracket when it is glossed as can, potential abil - 
_i^ in some sequences, and as subordinator for modifiers in sequences in 
which modified follows; but -te® also appears out of bracket, as a final par- 
ticle suffix, when it is glossed as that*s it. as in 
JThe guest just apt to come (that*s the situation ) , 



:k’e4 


0 

ren 


huei^ 


lai 


^ 0 
te 


(guest person) 


can 


come 


that*s it 
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M)] 


rm 

•■M 


Ml 
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m 



There is no ambiguity in glossing the suffix -le® as perfective when it occurs 
within bracket while glossing it as oh look , obviously , as you can see , don*t 
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you see , whan it occurs as a final particle suffix, out of bracket; both occur in, 
He has eaten noodles ( as you can see ) . 



(24) t’a* 


c i 


le° 


. 4 

mien 


le° 


he 


eat 


perfective 


noodles 




[M] 

a 


[M 






m 



The brackets in sentence (24) may be reordered from a b c to c a b for 
Noodles he has eaten , but the out of bracket m representing the final le^ does 
not appear in this reordering; in fact, final le^ as particle out of bracket 
never follows suffix le^ in juxtaposition or, conversely stated, suffix la^ 
never precedes out of bracket final le^. There is no ambiguity in the mes- 
sage of sentence (24), or in any sentence including transitive verb with object 
specified or intransitive verb with certain kinds of complements specified; in 

the sequence M - m at the end of a sentence, with M representing verb and m 

0 0 

representing le , the le is perfective. But in sentences including transitive 
verb without complement, in the same M - m sequence at end of sentence, the 
m may represent either le^ as perfective within bracket or the leP as final 
particle suffix, out of bracket, Compare the two messages. 

The guest has come ; or ( Oh look ), a guest is coming . 

(25) 
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These two suffixes, te and le , belong in both the first list of affixes 
(with dependence range within the phrase enclosed by brackets),, and in the 
second list of affixes (out of bracket, with dependence range extending over 
the whole sentence) . Other affixes in the second list appear exclusively out 
of bracket (see above). Other affixes in the first list appear exclusively 
within brackets, as the suffix 5e which marks durative , progressive , and the 
suffix man^ which marks plural of pronouns and human nouns, as shown 
above. Prefixes (minor morphemes) in the first list include hen^ very , and 
pu not , when in self-sufficient senicences, and a few others which occur in 
such sequences as m-M, m-m-M, and m-m-m-M, where m represents a 
prefix, while M almost always represents a verb. But hen^ very appears as 
a major morpheme in [hau^ te'^ hen"^] good to an extreme . Compare differ- 
ent comments after the topic [t'a^] Im, as [not corne l (pu® lai“); and as 
[also come ] (ie lai ), and as [future again come ] (cai^ lai^), on the one 
hand; and on the other hand, the same three prefixes in one comment: 

He won*t ever come again (that*s t^^e situation) . 



(26) t’a‘ 


.4 

cai 


. 0 

le 


0 

pu 


1 

lai 


le“ 


he 


future again 


also 


not 


come 


that *8 ^ 




[m- 


m- 


m- M]^ 


m 



The first m- in bracket b, representing eai^ future again would not 
appear in a sentence like this without final particle suffix, out of bracket. 
And so in general, when not in dependent clause, ^ai^ co-occurs with final 
particle suffix — as le® in sentence (26), and as pa® in (27); 



o 

ERIC 

hiaifiiifftaiTi-Taaij 
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Let*s chat a little more . 



(27) cai'^ 


4.» 2 

t an 


t an 


0 

pa 


future again 


(chat 


chat) 


advisatlve 


fm- 


(M + 


M)] 


m 



The following sentences touch on an interesting distinction between 



'think verbs' (hope , expect , believe ) , which may have clauses — precedable 

by pause «- as objects, and 'link verbs' the objects of which have pivotal 
function. 



-i he'll come again . 

(28) uo^ pu^ liag^ t'a^ 
not think he 

[M]^ '"[m- M]^ [M] 

Bracket a represents the sentence topic, bracket b the sentence comment. 
But within bracket b are brackets x, a transitive verb phrase relating the 
sentence topic to the object clause, brackets y and z. Within this clause, 
bracket y represents an embedded clause topic, and bracket z an embedded 



.4 

cai 

future again 
[m- 



la 



.2 



come 



M] 



b 



clause comments 

Compare sentence (28) with a 'think verb* and (28^) with a 'link verb* 
I don't want him to come again . 

lai^ 
eome 



(28-) uo^ 


1 

pu 


4 

lau 


,, 1 
t a 


.4 

cai 




not 


want 


he 


future again 






■ 


[M] 

y 


[m- 



M] 



zj, 



Here ta^ functions both as object of the *Unk verb* want and as subject of 
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come ; hence, brackets y and z no longer form a clause as in (28). 

A pivotal construction after a link verb also appears in the following 
sentence of command^ which is structurally a sentence comment without 
preceding sentence topic; 

Ask him to come again . 

(29) ^*iii^ t*a^ cai^ l,„i^ 

invite he future again come 

[[M]^ [M]y [m 

Bracket a represents the sentence comment; within bracket a is bracket x, 
a transitive verb whose object is represented by bracket y, which also serves 
as subject of z. 

He did right in beating that dog . 



(30) t'a^ 


ta^ 


•4 

nei^ 




kou^ 


^.4 

Si 


• j 


, .1 ..0 

kai te 


he 


beat 


that 


classifier 


dog 


11 


ought 


ought that 


l!“ix 


[M] 

y 


fJSi 

M 


m 




f[M] 


[(M- 


M) m]J 








-*a 






b 


Sentence topic. 


as a whole, functions 


as a clause 


subject. 


It includes 



embedded clause topic, x, and embedded clause comment, y and z. Though 
embedded in this sentences , the clause is not a dependent one because it is 
possible to say [he ] [beat ] classifier dog ] independently. Sentence 

comment b , however, appears s cited only after a clause* included sentence 
topic — * that is, after a sentence topic which marks action, as dog beatin g in 
sentence (30). If^ instead, the sentence topic included a noun or noun phrase 
rather than a clause with verb, comment b would be altered from the selec* 
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tion eitedf[ (ought ought ) the following: tuei"* te" (correct that ], 

with the translation altered from an ethical judgement, He^ (or does) right 

m be^j; th^^ _ the dog deserved a beating — to a statement of approval, 
in the sense that the actor ^ (or U ) judged to ^ quite correct . 

At the sentence level, variability in iength depends on whether a sen- 
tence includes topic and comment, or comment alone; and on whether a clause 
IS mcluded in either topic, or comment, or both. At a lower than sentence 
level, phrase brackets are said to be short when they enclose one or two ma- 
jor morphemes (or parentheses for compound or reduplication) - phrase 
brackets are said to be long when they enclose more than two major morphemes. 

Where a sentence is of the interrogative kind in which the question is 
marked by paired alternate choice, the bracket span or phrase is generally 
short. The subscript letters for the brackets specifying alternate choice in 

the following sentences are b fbr the first choice, and b-prime (b') for t!ie 
second choice. 

Are you going ? 



(31) ni^ 


e’a^ 


0 

pu 




jmu 


££ 


not 


£0 


[M]^ 


[Ml 

••mJb 


[m 


Ml 

m^b' 



The minimum answer to this question is either bracket b (for the affirmative), 

or bracket b' (for the negative) - either £o, or not£o. Compare alternate 
choices without negative, as in 

Does he want tea or coffee ? 
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(32) t’a^ 


. 4 
lau 


c a 


hai^ 


^.0 

S-t 


. 4 
lau 


k* a^fei 


he 


want 


tea 


Still 


be 


want 


coffee 




rM 

*-m 


■M], 

b 


-[m- 


M]- 


rM 





The minimum answer for this question is either bracket b for the first choice, 
OT bracket h' for the second choice — either want tea or want coffee . Where 
the object of the verb is included in the alternate choice brackets, a bilaterally 
dependent phrase bracket meaning or , -[hai^' or of t^n inter- 

venes between the* brackets or (rarely). precedes the alternate choice brackets. 
The alternate choice brackets may also be juxtaposed and the bilaterally de- 
pendent phrase hai s* may be omitted — without changing the message. So 



also in. 

Do you like to eat rice or eat s^ioodles ? 



(33) ni^ 


§i3 


huan^ 


w 9 • 1 

C -k 


fan'* 


hai^ 


.0 

S4r 


c •* 


4 

mien 


you 


(jojr 


-delight) 


eat 


rice 


still 


be 


eat 


noodles 


[M]a 


[(M 


- 


[M 


M] 


-[m 


- M]- 


[M 


M] . 



The most likely answer to this question is either brackets b and c for the 
first choice, or brackets b and c' for the second choice — either like eat 
rice or like eat noodles. 



Sentence (31) is briefer than the following sentence (34), in which a 
directional phrase is in a dependence relationship to each of the preceding 
alternate choice brackets, and is therefore enclosed in a separate bracket, 
discontinuous with the first choice bracket b. This directional phrase cannot 
be included exclusively within the second choice bracket b^. because its 
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dependence range extends equally to both brackets, as the minimum answers 
show. 

Is he jumping down ? 



(34) t’a^ 


, » . 4 

t lau 


1 

PU 


4-* ‘ ^ 

t lau 


. 4 
sia 


C U 


he 


jump 


not 


jump 


down 


away 




[M]“b 


[m- 




rm 


m-* 

M^c 



The minimum answer to this question is either brackets b and c (for the 
affirmative), or brackets and c (for the negative) — either jump down 
av/ay , or not jump dow n away . 

Most phrase brackets in the alternate choice sentences, (32), (33 ), (34), 
include two morphemes. When a sentence is of the kind in which the question 
is marked by an interrogative demonstrative, the bracket span or phrase is 
often shorter. 



Whom did he beat ? 




( 35 5 t * a ^ 


3 

. ta 


7 

V/ • 

sei 


l:e 


beat 


whom 


[Mia 


[M], 


[M] 

c 



In a possible answer to this question, brackets a and b remain unchanged 

3 

while, for bracket c, a pronoun (uo me ) may replace the interrogative 
demonstrative — he beat me . 

Who beat him ? 



.2 

sei 




t’ 


who 


beat 


Imn 


[M], 


[Mlb 


■ ■ 
o 
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In a possible answer to this question, brackets b and c remain unchanged 
while, for bracket a, a pronoun (uo"^^) may replace the interrogative demon- 
strative — I beat him . And so also for other interrogative demonstratives 
(as na^ li v/here , sem ^ me^ what, na^ which) ; the order of brackets in 
the question sentence remains parallel to that of the answer sentence which 
differs from the question only in substituting a noun or pronoun for the interro- 
gative demonstrative. Like other demonstratives, the interrogative demonstrative 
also appears as a prior member before classifier relevant to a following noun 
(see above). 

However, where a sentence is of the interrogative kind in which the 
question is marked not by paired alternative choice, and not by interrogative 
demonstrative, the bracket span or phrase either is, or may be, long that 
is, longer than one or two major morphemes or their compounded or redup- 
licated equivalents. The same four morphemes appear below In sentence (37), 
bracket b (prior to the interrogative final particle suffix, ma out of bracket), 
sentence (38), bracket b, (where the interrogative is '.narked by giving 
an alternative choice in bracket c — » not, however, a coordinately paired 
choice). 

Has he jumped down? 



(37) t'a^ 


■ 1 . _ 4 
t'lau 


4 

y • ^ 

sia 




le° 


0 

ma 


he 


jump 


down 


away 


perfective 


interrogative 


[Mj 

a 


[M 


m 

M 


m 

‘m 


m]^ 


m 
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(38) t’a^ 


t* iau^ 


4 

SIS. 




le® 


.0 

mei 


. 0 
’OU 


he 


.ium£ 


down 


away 


perfective 


not 


have 


■ [M] 

a 


[M 


m 

M 


m 

M 


ml 

^b 


r- 

'M 


M] 

■*c 



Though the question is the same, whether expressed as sentence (37) or (38), 
the difference between these two sentences is appreciable in terms of the dif- 
ferent minimum affirmative-negative answer or response that would follow. 
For sentence (37), it would be affirmative it*s that (s**^ te®), or negative 

(naei iou ). For sentence (38), the minimum answer would be either 
bracket b affirmatively, or bracket c negatively — either jump down away 
perfective , or has not . 

Many sentences show twc successive brackets enclosing noun and verb 
phrases — the topic (as subject) and the comment (as predicate) . But the 
topic cannot always be equated to subject in function, nor the comment to the 
predicate in function. When the topic functions as subject and the comment 

as predicate, they do so as one special function out of the various functions 
of topic and comment. 

At a lower level, bracket phrases are often short, especially so in 
alternate choice paired questions and their minimum answers, and in imper- 
ative sentences. Wherx the brackets are long, the order of morphemes within 
the bracket is fixed in reference to a nucleus verb or noun (which may be a 
nucleus major morpheme, M, or nucleus parentheses enclosing reduplicated 
of compounded M*s). Though both short and long brackets, as wholes, are 
often reorderable or permutable within a given topic or comment, the order 



o 
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of morphemes within any given bracket is no more reorderable than are af- 
fixes in languages with a *word* level. Indeed, the bracket as a whole is 
analogous to a ’word* in this sense. But in another sense, it is analogous to 
a phrase. The nucleus noun or verb in the bracket may be flanked not only 
^-ffix-like minor morphemes (m, in our notation), but also by morphemes 
which function as minor morphemes, when flanking a nucleus major morpheme 
A hundred or so classifiers may appear in the same bracket before noun nucleus 
just as a smaller inventory of localizers may follow the noun nucleus, again in 
the same bracket, as has already been shown. But the very same morpheme 
that functions as localizer (^) after noun nucleus, or cls classifier (^) before 

noun nucleus, may also function as the noun or verb nucleus (^ ) in another 

m 

bracket, as is most clearly shown in sentence (4a), where fe^^, as verb nu° 
cleus, means to seal (^) in one bracket, while fe^® means classifier relevant 
to envelope or small package (~) in another bracket. 

In general, the relationship of prior morphemes to the nucleus of the 
bracket is predominantly that of modifier^modified — both when the modifier 
(or modifier sequence) is adverbial before a verb nucleus, and when it is ad- 
jectival before a noun nucleus in the bracket. Examples of the latter are big 
horse [ta ma ], small house [^ia^ (£a^ ct^)j, and good man fhau^ ren*"]. 

The adjective noun sequence is in effect an expanded noun phrase (as in sen- 
tence 6l ) . 

There is no predominant, general relationship of following morphemes 
to the preceding noun or verb nucleus, though one of the most clear-cut rela- 
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tionships is that of localizer aft(^r the noun (as house in [Cia^ li^] for in the 
house ) , and of directi©nal after the verb — but many other kinds of mor- 
phemes may also follow in the same bracket, including suffixes (see above). 
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The term ‘verb complement* is used for all morphemes of the ~ type 

M 

that follow the phrase nucleus in the same bracket. When a single verb 
complement or a pair of such complements specifies direction of an active 
verb, it is termed a 'directional'. The same morpheme is glossed differ- 
ently when it functions as a directional in one bracket [* * *^ 1 than when it 

functions as a nucleus verb in another bracket • 1 (in most instances 

m 

with change of tone). Thus [* • ’ he re, hitherward] as directional, but 

[come * • • ] as nucleus verb (lai^); [there, away] as directional, but [go • • * ] 

as nucleus verb (5'u*^); [• * * down ] as directional, but [descend • • • ] as verb 

(?ia); [* * • as directional, but [ascend * * • ] as verb (§a^ ); and [• • • across 

as directional, but [pass by, pass across • * • ] as verb (kuo). The directional 

may be followed by a suffix (as the perfective suffix in the following examples) 

1 3 0 

Compare he ran (t'a^ p'au le ) with 
He ran here . 



t'a^ 


1 3 
p au 


lai° 


0 

le 


he 


run 


hither 


perfective 


[M] 


[M' 


m 

M 


m] 



Sequences of two directionals after active verbs also occur marking both 
( or down or across ) , and either hitherward or away (from speaker ).,. Com- 
pare lie £as£^ t'a^ kuo le^ [M] [^ m] — with 

He ran across that way. 



(40) 


fal 


1 3 

p'au 


kuo 




1 0 
le 




he 


run 


across 


away 


perfective 




[M] 


[M 


m 

M 


m 

M 


m] 
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Compare eaten it (t'a if'i le ) with sentence (41) which might 

be utter ed when formerly visible food had been all eaten, so that no food 
remained to be seen. 

He has eaten it up completely. 



(41) 



1 



L'd, 



V ..1 

c’* 



Sia° 



I **® 

c u 



le 



0 



he 

fM] 



down away perfective 

r ^ m 

fM - — m] 



M 



M 



Sentences (42), (43), (44), which follow, exemplify the triple referents 
or meanings of two morphemes which are members of a single compound, 

(^'i lai) — one referent when the compound functions as the verb nucleus 
(42), and two different referents when the compound functions as a verb com- 
plement, depending on whether the verb complement follows an active nucleus 
verb (43), or a stative nucleus verb (44). It means ug after active verb, but 
beg^, £ tart after stative verb, when it is called a 'resultative complement' 
rather than a 'directional complement'. As a directional complement, the 
compound is glossed literally as (upward hitherward) . As a resultative 
complement, it is glossed (inceptive resultative ) . As the verb nucleus it- 
self, the same morphemes in the compound are glossed (rise come). The 
examples which follow, let us repeat, are instances of a single compound with 
three different meanings in three different environments; this single compound 

is not to be confused with sequences of two directionals which are not members 
of a compound (40, 41) • 
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He arose (got up from bed). 



1 


^ 3 


2 


le^ 




(42) t'a 


J«'i 


lai 




he 


(rise 


come) 


perfective 




Cm3 

He has stood 


[(M. 

m 

liE 


M ) 

m 


m 3 




(43) t'a^ 


J?an^ 


v,.o 

c'l'' 


Ui® 


0 

le 


he 


stand 


( upward hitherward) perfective 


Cm] Cm ( 

M 

He has started to laugh 


m ) 

M 


mj 


1 

(44) t'a 


liau^ 




lai" 




he 


laugh 


(inceptive 


resultative) 


perfective 


Cm] 


[m 


(iE 

M 


iE) 

M 


m] 



When the subject of a sentence is included in the topic, the observable 

movement of the subject is minimized by stative verb comment (marking to 

laugh, to sing, to sleep, etc, ) but not by active verb comment (marking to run, 

to eat , to swallow, to walk, etc) . Some stative verbs are incompatible with 

topic functioning as subject. And they occur generally in comment sentences, 

4 ^ 

without any prior topic at all. The compound verb to rain (sia u ) is an 

example of such a stative verb. Accordingly, the compound verb complement 

()$'i lai^) is glossed ( inceptive resultative) — rather than ^ (upward hither- 

4 3 

ward)-- when combined with the compound verb (^ia ii ), meaning it started 



to rain (rather than * it rained up here ). In general, the happenings of natural 
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phenomena are specified by stative verbs which are incompatible with prior 
topic functioning as subject. 



verb compounds appear not only with the constituent morphemes 
in a juxtaposed sequence, as (§ia^ u^) it's raining , but also in a discon- 

tinuous sequence, with the first member of the compound (gia'^) — separated 
from the second member of the compound — - (u^) by an interspersed or in- 
fixed morpheme (as perfective le®); le® could also be added as the final 
morpheme of sentences (45) and (46), just as it could have been inserted 
after in sentence (41). 

It has rained. 



4 

sia 


le° 




descends 


perfective 


rain 


[(M) . 


m 


-(M)] 



In general, two member verb compounds of the kind glossed ( sleep - nap ) and 
(sing ji^ong ) are separaTale in the same way. Compare both [he ] ( sleep - nap ) 
sleeps (t a guei giau ), and [he ] [( sing song )] for he sings 
(t'a^ keM, with sentence (46); the perfective le® may also be 



infixed between sing (Cag'^)- and song -(keS . as another example of discon- 



tinuous compound. 
He slept. 



t'a^ 


Suei^ 


le° 


4 

giau 


he 


sleep 


perfective 


nap 


[M] 


[(M)- 


m 


-(M)] 
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0 0 

The compound verb complement, lai ), appears as a continuous 

compound after stative verb (38); it is also separable, but in a new way. What 



0 



_. 0 . . 



intervenes between (c'i ) = and = (lai ) is the second member of a compound 
verb. Then the first member of the compound verb precedes the first member 
of the compound verb complement in an interlocked or interwined sequence, 

[( )- -( ) = (lai°)j: 



It*s beginning to rain that* s the situation, 

0 



(47) sia"^ ^'i 



..3 

u 



, .0 

lai 



le 



0 



descend inceptive rain resultative that's it 
[(M)- 



(-) = -(M) 

M 



=(--) ] 
M 



m 



0 



In this sentence, the final particle suffix, le , is out of bracket and yet, 
though merely background, a necessary 'ground* for the bracket 'figures' 

in the foreground, (see above). Within the bracket, the compound nucleus 

. ^ . 0 
v3rb (^ia u ) combines with the compound verb complement (c'i 

0 

lai ) in a dis continuously intertwined fashion —as if two double -pronged 
gestalt figures, I i and i ( , were integrated in one interlocking figure^ 

rnxj , 



Nucleus verb compounds, like that for (descend rain) above, combine 
with verb complement compounds only by integration — by rxi-J ii^terlocking. 
Nucleus verb compounds like those for ( sing song) and ( sleep nap) may be 
reduced to their first members before the compound verb complement, as in 
sentence (48), or they may remain compounds and integrate with the com- 
pound verb complement by rxu inter locking, as in sentence (49). 



87 



Sino-Tibetan Fascicle One 



He started to sing. 





1 


4 


^ 0 


1 -0 
lai 


0 


(48) 


t'a 


c’an 


c'l 


le 




he 


sing 


inceptive 


resultative 


perfective 




[m] 


[m 


^ m 
( M 


J3L V 
M ) 


m] 




1 


4 


^ 0 


1 


0 0 


(49) 


t'a^ 


c'a^ 


?'i 


ke 


lai le 




he 


sing 


inceptive 


song 


resultative perfective 






[(M) - 


(2S) = 


-(M) 





Some compounds are potentially separably, in the sense that certain 
other morphemes in the comment may intervene between the memVjers of the 
compound thereby making the compound a discontinuous one (see above). 

Other compounds are characteristically discontinuous — in other words, dis- 
continuous whenever possible , Thus, the compound verb complement for 
f^out hitherward occurs as a continuous compound only in short comments, as 
in the following sentences: (50) and (51). When the comment is longer, by 

virtue of including the object of a transitive verb, this object appears either 
as a short inserted bracket (sentence (52)), or as a long inserted bracket 
(sentence (53)), between the first member of the directional compound and the 
second member. The latter is not really a separate phrase, but rather a 
discontinuous appendage of the first phrase in bracket b. In the following sen- 
tences, the comment follows either topic as subject — (50), (52), (53); or 
topic as object — (51). 

He walked out (toward speaker ). 



88 



Anthropological Linguistics, Vol. 6, No. 3 



(50) ta^ 


3 

cou 


A'. ° 

c *u 


lai° 


he 


walk 


out 


hitherward 


Cm] 


[m 


^ m 


“)] 






m" 


M 



Put the money 


out here 


(with pause 


2 

after c'ien ), 






(51) c'ien^ 


2 

na 


s., 0 

c'u 


0 

lai 






money 


take 


out 


hi t herward 




[m] 


[m 


m 

^■m 


m T 

m) 






He put the money out 


(toward speaker). 






1 


2 


0 


2 


0 




(52) t'a 


na 


<^'u 


c'ien 


lai 




he 


take 


out 


monev 


hitherward 




ChI, 

cL 


Cm 




Cm] 

C 






He took out a book. 










1 


2 


0 


0 


2 1 


0 


(53) t'a 


na 


^•u 


• 

1 


pen §u 


lai 


he 


take 


out 


one classifier book 


hitherward 


M 

oi 


Cm 




P m 

L M 

I 


1 


b 


Compare sentence (52), above, and sentences like (54) and (55) where — 


though the subject is not 


specified — the verb is 


Still active rather than passive. 



As for the rice 


(pause), it has been eaten 


up. 


4 


1 2 


0 


(54) fan 


uan 


le 


rice 


eat finish 


perfective 


CKI 

a 


Cm ^ 


m J 

b 



As for the bowl (pause), it got broken. 
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3 


3 


4 


0 


(55) uan 


ta 


p'uo 


le 


bov/1 


beat 


broken 


perfective 


CMj 

a 


[m 


m 

M 


— T 

m j 

b 



In the free translation, added for smoothness. The actor is simply not 

specified in sentences (54) and (55)— and these are clearly not passive sentences; 

but sentences in which topic, a, functions as object and comment, b, includes a 

transitive nucleus verb, followed by verb complement and perfective suffix. 

Sentence (56) shows a passive construction in 'acquired Mandarin' 

4 

with pei before a transitive verb nucleus. Compare a possible answer to 

.4 1 

the question By whom was he beaten? — pei t'a by him . An active, rather 



than passive, causative construction is expressed in sentence, (57) with pa 
before a transitive verb nucleus. 



0 



The dog has been beaten by him . 



0 



(56) kou” 
dog 



pai t'a ta le 

by, be caused he beat perfective 



[Mj Ct 



M 



M 



m 



] 



(As a major morpheme in compound, pei is glossed to be covered , to cover . ) 
He has broken the bowL 
(57) 



1 


0 


2 


3 


4 


0 


t'a 


pa 


uan 


ta 


puo 


le 


he 


by handling 
m 


bowl 


beat 


broken 

m 


perfective 


[m] 


f M 


M ] 


[ M 


M 


m ] 



These two sentences differ in the function of the topic — with topic as subject 



(57); and with topic as object of the verb ta beat (56), 
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When the nucleus verb is preceded by one verb in the domain of travel 
(as walk, run , go , come ), the relation of the prior verb to the nucleus verb 
is simply purposive (in order to). 

Pm going i n order to see him, 

/KQ\ ^ V, ..0 1,1 ^ ^ 

(58) uo c* u k'an t'a 

1_ ^ see hx m 

CmJ [m mJ |M} 

4 

Compare huei can , know how to, followed by verb, as in I knov/ how to 
sing [uo^ Oiuei^ (if*ag ke )] . 

The comment may consist of a series of bra^^kets, each with a nucleus 
verb, which may be flanked by another verb dependent on the nucleus verb 
(a«id/or by prior adverbia l, and/or by following verb complement). The 
different brackets in a successive series of brackets are coordinately related, 
and hence the brackets in itemization series are maximally reorderable. 
Every day he reads and eats and sleeps. 



1 

(59) t'a 


2 

mei 


0 

t* ien 


2 

tu 


1 

^u 


1 

C*4 


4 y. 4 

ian suei ciau 


he 


every 


day 


read 


book 


eat 


rice sleep nap 


[m] 

a 


[m 




0M 


M)] 

c 


[(M 


mO Hm - M)] 

“ e 



Sentence (59) io ^eoi’derable to Ever^ ^^ day he reads a. id sleeps and eats; and 
to Every day he sleeps and reads and eats; and to Every day he eats and sleeps 
and reads ; and so on. 

In contrast to th*i brackets in successive series, above, brackets b 
and c in sentence (60), belov/, are not coordinate because the nucleus verb in 
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each is differently flanked. In c, the nucleus verb is flanked by prior purposive 
verb^^ ^rder to; in b, the nucleus verb is flanked by following verb com- 
plement audits object. The free translation telescopes run and its complement, 

which can only have a place as object, and the purposive go into 
two words, went to, 

I went to his house to see you. 



2 

uo 


3 

p* au 


.0 ,1 
tau t * a 


Sia ^ 


4 

cu 


k*an ni" 


I_ 


run 


reaching his 


house 


JO 


see you 


[m] 

a 


[m 


m 

“m m 


M] 

b 


[m 


m] [m] 

C 



A short bracket encloses one or two morphemes or parentheses 
(compounds and/or reduplications). A long bracket encloses three or more 
such units. A long bracket with verb nucleus is naturally res'ericted to about 
three or four such units, because longer stretches than this would be se- 
gregated into successive brackets, each with its own verb nucleus, as in 
sentences (59) and (60), above. 

Brackets with noun nucleus may be somewhat longer, in their max- 
imum, than brackets with verb nucleus. This is because it is possible to 
expand a demonstrative (or numeral) -classifier- noun bracket, by the insertion 
of a single adjective (or an adjective sequence) between the classifier and the 

noun nucleus*^ as ^ between two - classifier and dog_for dogs (lia^'"^ 

4 3 

Cl ta kou ). When a longer adjective sequence is inserted, all morphemes 

2 

preceding are in a modifier relationship to the modified nucleus noun (ren 



m^, in the following sentence). As many as a half dozen morphemes, more 
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or less, may precede the nucleus noun — five single morphemes and one com 
pound in 



Three men wearing black clothes are dining. 



1 

(61) san 


ke° 


c'uan 


three classifier 


wear 


[m 




m 


M 


2 

ren 


.4 

cai 


v,.l 

C*4 


fan^ 


man 


at 


eat 


rice 


mJ 




(M 


M)] 



^ } 0 0 
hex i fu te 

black (clothes clothes) 's 

M (M- M) m 
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The preceding sentences lend themselves to a smnmary of text frequency, 

with the count based on the 6l sentences as cited above. Thus, the nmiiber 

of occurences of morphemes in the 6l sentences is 333 ^ where a given morpheme 
/ 1 \ 

(as ta he; is counted separately every time it occurs, rather than once for 

each morpheme (as might be done in an inventory count). 

Each morpheme in the 6l sentences falls into one of three classes (M, m, 

M m 

or m alternating with M). 

Most of the morphemes in these sentences - ^8 percent - are always major 

moiphemes (M’s)j and here the text frequency approximates an inventory count. 

(it seems likely that more than half of the total lexical resources of ’acquired 

Mandarin’ would function invariably as major morphemes.) 

Surprisingly hi^ in text frequency are the affix-like minor morphemes 

(m’s). Almost one fifth of the 333 moiphemes fimction as such; IT percent 

are always minor morphemes. But in inventory count, there are only about a 

dozen minor morphemes, which occ\ar again and again in the different sentences 

- for example le® appears 23 times, te° appears 12 times, and so on to ie°, 

pa°, and ma° which occur once each in text frequency. In other texts than 

our sample of 6l sentences, the latter might show a much hi^er text frequency. 

In general something less than one quarter of the morphemes in any text mi^t 

function Invariably as minor morphemes (m’s); and as noted above, something 

more than one half function invariably as major morphemes (M’s). 

That leaves one quarter of the 333 morphemes which function as minor 
m M 

moiphemes (M) in some environments and as major morphemes (m) in others. 

Exactly 82 out of the 333 * ^5 percent of all morphemes in text frequency - 

fall in this alternable M m class. Accordingly, about hedf of the 333 moiphemes 

£ ^ 

(the minor morphemes, m, and the alternable M ** m classes), are as relevant 
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to ’morphology* as to ’sjmtax^* 

At the next higher level of gramoiar^ coopouziids qxuSl/ot reduplleatloz^ '-r.!' 
enclosed in parentheses. Such x>arentheses are found In more than hcQ^f 'f the 
sentences - in 33 the ^ sentences. Ih seme sentences however^ a single 
morpheme or tvo Is flanked hy compounds and reduplications; thus, the sequence 
for sentence 13 is (reduplicated classifier )**(cc(i!pound noun)«single morpheine 
glossed a21-( compound pi*edicate adjective)”. 

^e member of a compound is frequently bound « i.e. found only in conpound' 
but both members of a compound may be free (oecuring either in compounds cr as 
singj.e morphemes)® I«8s commonly, both members of a compound are bound morphemes. 
Repeated members of a reduplication are more often free than bound. Of the 33 
sentences showing reduplication and/or compounding, scarcely more than half a 
dozen parentheses enclose a bound morpheme. Here text frequency shows that les 

than 20 percent of the compounds include a bound nemher. What would an inventory 
count of a dictionary show? 

The parenthesis span, which includes two or more morphemes, whether bound 
of free, functions as one ’part of speech* Just as each single ma,Jor morpheme 
out of parenthesis functions as one ’part of speech*. In the functional view 
then, the parenthesis span and the si n gl e major morpheme are at the same 
word-like level (since each can be classified as ’noun* or ’verb* or ’adjective* 
or adverb' - and in addition some single major ^morphemes may be classified as 
’pronoun* part of speech). What makes laiis word-like level difficult to 
rjaintaln is that at a still hi g he r level in the grammar, the phrase bracket is 
also word-like in terms of major ’parts of speech* (since the bracket phrases 

distinguish the same parts of speech that, single major morphemes distinguish, 
including ’pronoun*). 
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At the intrabracket levol, relative order of single morphemes and parenthesis 

spans is very fixed or restricted, not \nJLike 'morphology' in languages with a 

'word* level. It is at the hi^er interbracket level that contrastive and non- 

contrastive reorderings of 'syntax' are found, while 'morphology' is a matter 

of the fixed relative order at the intrabracket level. But 'morphology' and 

M 

'syntax' are telescoped into one whenever a single morpheme (M or m) is the 
only member of a phrase bracket, as happens in 69 instances out of a total of 
225 phrase brackets in the 6l sentences. Well over half of the phrase brackets 
- altogether I56 - include two or more morphemes. And this means, in short, 
that most phrases in the 6l sentences involve problems in 'morphology' , while 
all phrase brackets are involved in 'syntax'. In the I56 phrases that include 
two or more morphemes, a single morpheme or parenthesis ( compound and/or 
reduplication) serves as the nucleus of the phrase bracket, and the other 
morphemes in the phrase are in a fixed order relative to this nucleus. 

In short phrase brackets, a single morpheme precedes or follows the 
nucleus. About 10^ of the I56 phrases with two or more morphemes show minor 
morphemes (m's) preceding the verb nucleus (sentences 3> 6, 15, I6, 22, 23, 

2 J, 28, 29, 31> 32, 33> 3^> 38). But major morpheme (m) is also found to 
precede, as modifier, the phrase nucleus major morpherae(M) - in a couple of 
instances (sentences 23, 25 )• Some 6 ^ of the same 156 phrases show minor 
morphemes following the phrase nucleus (sentences 1, 2, 3,4, 7, I8, 21, 24, 25, 

30, 42). 

Over 805^ of the I56 phrases with two or more morphemes are long phrase 
brackets in which one morpheme precedes while another morpheme follows the 
nucleus morpheme; or a pair of morphemes precedes (or follows) while a single 
morpheme follows (or precedes) the phrase nucleus; and more than two morphemes 
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miy fXaaak the phrase nucleus (as In sentences 26, 37, 38, 4o, k3, 44)* 

Scae phrase hraclkets in sentences vhlch are uttered only in nacro-s^tax 
(as U, 12, 21) api>ear vithout major morpheme, as a sequence of tvro minor 
morphemes^ including classifier* Ilhen tvo or more xsajor morphemes appear in 
the sane phrase, the nucleus morpheme is that morpheme vhose *part of speech' 

_ class coincides vith the phrase *part of S3peech'; thus, in a noun phrase the 
modified noun is phrase nucleus after the modifier, and in transitiTe verb 
phrase vith non-permutable object noun, the verb is the nucleus morpheme 
before the object noun (as in sentences 17, 32, 33 )• IR^n tvo major morphemes 
in the one phrase are both 'verb* part of speech, the second . 'verb* functions 
as phrase nucleus vhen the first is in the domain of travel (jgo or come in 
Older to )* vhile the first is phrase nucleus in the domain of a]^rais8l (^ 
lihe, tojmnt, to irlih» etc*, as In sentences 5d and 59)0 

final particle suffixes out of bracket (ss in •sentences 22, 23) oxe merely 

% 

background, but may be an Indespenslble 'ground* for the complex 'figures' in 
some foreground brackets (as in sentence 4 t)* Most brackets, hoirever, vhether 
topic and coBHKnt, or ooeinent alone, are self-sufficient In the sense that the 
background particle suffix is optional. 

Siro^thlrds of the 6l sentences in the saaple have topic functioning ae 
actor before comment in vhich only one bracket functions as terb phrase (including 
predicate adjective functioning as verb in sentences 6 to 13, and 20). A lialf 
dozen additional sentences have eonuents vhich include tvo brackets, each 
functioning as verb phrase. 

Xn the re m a ining third of the Si sentences, the topic may function as 
object batof eomnent (vith eiftitdided subject) or the topic inelxdes an embedded 
verb before oomnnent vith aln^pn verb, or oommant appears vithout prior topic. 
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again either \rith one or t\ro phrase brackets. 

With two verb phrases in the comment, the second may negate the first 
(as in sentence 3I: [go]a [ not go ]-}^) in order to elicit an answer (eitner 
bracket a or bracket b). Or the second verb bracket (b prime) may give an 
alternative choice to the first verb phrase bracket (as in sentence 32), in 
oirder to elicit an answer (either bracket b or b prime). With three verb 

phrases in comment (as in sentence 33), the first verb bracket (b) is a transitive 

verb with the paired verb phrases follovring ser.ving as alternative goals (c 
and c prime) to elicit an answer (b and c) or (b and c prime); or (as in 
sentence 39 ) » the 'bhree or mo>^ verb phrases may be coordinate in a simple 
series, without offering interrogative alternatives. 

The preceding 6l sentences constitute one possible sampje of *acq.uired 

I'fe.ndarin' which serves as a profile graiiimar that is isomorphic with grammars of 

all Han Chinese (but not q,uite isomorphic with Wenyen). Some differences among 
the various Han Chinese languages and dialects are noted below. That is, the 
profile of our 'acquired Mandarin' grammar, thou^ isomorphic with other Chinese 
languages, is not identical with them. The differences are found mainly in 
phonology, but are also found in grammar. 
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Aside from dlfferenoes In phonemio imentarteSf iihloh are 
given in a foULcving fasolole^ the chief diversity among the Hon Chinese 
languages and dialects Is found ii differences in the shapes of moirpheines* 

Some of these (as for ten ; Mandarin hut Cantonese s&p) though different^ 
are still cognate that is^ derived from a single parental morpheme* Other 
morphemes are not only different in shape hut also different in etynon^ and 
yet mark the sane referent; thus^ uho is marked hy one morpheme (Sei^) in 
Mandarin b\xb in Cantonese hy another mocrpheme^ or rather hy two morpheinss 
(pjGr^ k5 literally side one )* In the case of the referent for clothing* it 
is Mandarin vhich commonly uses tvo morphemes in a compound (i^ fi^]^ uhile 
Cantonese ooiononly uses one (s4*m clothing )* A single hut different mor- 
pheme marks walk in Mandarin (2ou?) and Cantonese (h^*}])* 

Differences in structure are often asAoclated vlth different mor- 
phemes that different Ean Chinese languages select^ as iihen a given morpheme 
(or succession of moridiemes^ as mei^ iou^ in Ifendarln for not have ) entails a 
structural restriction (avoidance of mei^ in sentence final) not found in 
another language, as Cantonese vhere a single morpheme appears both in sentence 
final and in other parts of the sentence for not have (io^Su)* Bat attention 
can be focussed on occasional structural differences hy comparing the behavior 
of glosses in the 6l sentences cited for ‘aoq,uired Mandarin’, vlth different 
behavior of glosses in other Han Chinese languages* ^y the same device, the 
general sameness in structure among all modem Chinese languages and dialects 
can he reaffirmed in specific conteacts* 

In Peking ^hndarin, the first of the 6l sentences is revised hy tlie 
addition of one morpheme (pa^), i^oh changes the s-tan^Bture of the sentence 



vithoat changing the massage (I have written this letter in a slan-dash manner )* 
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Compare sentence (X), aborve, and the Beking Mandarin revision: 

(l) uo3 wa? rceP hu^ hnP te^ pa^ iW* 

Ll\ Ofhorse i: horse ) (tiger ->• tiger ) [take this claseifier letter! ^ 

C urlte ■perfectivel ;^ 

Tiro pibrase braclcets (a and b) are Identical in the 'aoqidred Mardarin' version 
and in the Peking Mandarin version. Phrase bracket c, IxmoedlateXy above^ adds 
pa3 take to what is the last bracket in the original version* The phraae bracket 
[write perfective l is last in the Bekixig revision (d)^ but pre —final (c) in 
the original version. The revised version alone is acceptable in Peidng Mandarin; 
both versions are equally acceptable to our speaker of •acquired Mandarin* • 

Ih addition to the slight difference as to what is included in brackets the 
sll^jht shift in order of [ write perfectivel * the two versions differ importantly 
in structure* The pbrase brackets of the original version can be reordered 
flrom abed (as cited) to d a b c; whichever order is followed^ an additional 
Iracket phrase (e*g* [^ table upl * meaning oa the table ) may be Inserted 
between phrases b and c^ with esipanded message* In the Peking version^ the 
reordering is from a b c d to a c b d— that is, the adverbial jdarase^ [ ( horse 
ho^) ( tiger » tiger ) -ly], may occur between the two I'erb phrases (c and d), 
or precede both of tiiem* But in either order of the Beldng structure, it 
would seem la m atural to esgsand the sentence b'/ a phrase localizing the writing 
on the table )* though the sentence migjht be eaipanded by other phrases, as 
[ yesterday! between topic and first bracket of oomment* 

The half dozen bracket phrases of sentence (3) ^ He has 3 aever played 
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togplly at home vlth hie own ^Hdren — seem to he eqjually reorderahle (ajod 
t^TuiIlAtahle) in all modem Chinese languages* A possible alternative selection 
of moTj^emes for hraohet b from [ not ever] (pu?* c*e:Q^) to [ not have] (mel^ 
lou9) vould mnd.er the sentence more closely tranelatahle Into Vu and Cantonese 
Xhe stmctcore of sentence (4) hoOLds for San Chinese In general^ hat 
wl*to selection of morpheine for letter showing a different etymon In S* JBLn, 
for eatan^ classifier for letter remaining as a cognate of feq^. 

ed i tion feQ functions as a verh ( to seal )*-^and enters Into a cooipound for 
envelope (letter sealer hut for envelope (?ln^ the compound Is 

preceded hy another classifier than fen^* 

(5) You give that man a hand shows redx^leatlon of the verh In 
Mandarin and Vu (help help] , hut not In N* Min and Cantonese where the phrase 
would ^ ^ossed [ help a stroke] or more briefly [ help strolae] before, bracket c 
Cthat classifier man! • She latter structure Is also possible in Mandarin^ as 
.QssJ foolp one helpt t or, with object, [you] Bielp one time] fthat c^issslfler 
man] • 

(O He*s very thin* The topic Che] Is followed by comnent consisting 
of ^Jectival verb preceded by an Intenslfler, fvery thii() , as a self-sufficient 
sentence* ^ Cantonese the morpheme for good serves as the Intenslfler, as In 
the sense of good and thin* Ohough the Intens^ler (hen^) is distinguished 
^om good (hsu^ ) In Mandarin, the latter may also serve as an Intezislf ler: 
t*a^ hau^ hual^ for He*s very bod * 

[ha] [good badJ 

As occasional Intenslfler, hau^ may appear In non-redi^lcatlve sequence, 

(very;) and then as an adjectival verb ( good )* Jn Vu, the 
Intenslfler Is marked by a two syllable adverb which is more emphatic than 
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Msjidarin for veiy (hen )• And the Wu bisyllabic morpheme may be final , or 

f 

precede a final particle >7ithout intervening verb. In .Mandarin, the occasional 
ihtenalfier (hau^) — but not the usual one (hen^) — may function as an adjectival 
verb immediately after the topic in a response sentence^ For example, the 
answer to *Who*s good?!' might be [t*a^] jhau^J he*s good ; rather than he*s 
yeiy good (6, above), 

He*s rather thin ( poor thing ) , with commiseration marked by 
reduplication — ( thin + thin ) , In Peking and in other national capital cities 
(at? Hanking), the non-syllable suffix -r (heard only in capital cities) appears 
where no suffix is found, as completing the reduplication ( thin ■I' thih-r ). or 
as an alternate selection for the syllabic diminuitlve -Cst, In Cantonese and 
in Wu, the morphemes (suffixes or enclitics) that appear after red\Q)licated 
forms are apt to be different than those that appear after the non- reduplicated 
fom of the moipheme. Compare non-reduplicatijd fom in sentence (6) with re- 
dui^licated form in ( 7 ), What such reduplication marks in Han Cliinese generally 
is a Hense of liveliness, as when one says f hej [fat | fat^ ly to not 

onJy that Hejs fat, but that there is something lively and cute aboait his 
being fat like a baby, 

eat this to see ( how it is ), with the first phrase reduplicated 
T and enclosed in brackets before the second phrase |_look3 ^ because 
object, when specified, appears between the transitive verb and the intransitive. 
This is a very productive pattern that appears in Han Chinese generally. Wu 
selection is much like Mandarin for Look and see - literally, loo k one look 
look, or more briefly [( look | look )] flook] for Take a look to see (how it 
is^ — k*an k*an^ k*an^ • Many reduplicated morphemes function as verb before 
j object and before intransitive, then see how it la (k*on^), with the object 



( 
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giving a semantic guidiS to the point in referent range of the transitive veil? 

(as ta for beat, use instroment play) #; Thus, H it Mm and see (what the 
consequences are as crying, running away, fighting back): f( beat | beat) ] 

fhliD] Clood • If the object is telephoned t;fee translation is Ifake a telephoae 
call and see: if mahjong, it is Play mah^ng and see. 

Sentences including classifiers — sentences (lO) to J13) are found ia 
CShlnese generally, And classifiers exbend beyond Sino-Tibetan to unrelated 
SB Asian languages. But such sentence profiles with classifiers differ from, 
these encountered in Han CMnese, 

(10) This card is pretty shows a profile In which deanonstrative ( this, 
substitutable by a numeral) preceeds classifier before noun ( card, or adjective 
and noun, as big card ). For this sentence Cantonese would select another 
etymon for the adjectival verb ( bri^ t ia the sense of pretty or attractive), 
without changing the classifier sentence profile, 

(11) HMs one is pretty shews a profile in which demonstrative ( this, or 
else a numeral) preceeds classififer (substitutable by another classifier for 
This kind is pretty) , 

(12) Ev ery one of them Is pretty shows a profile in which the reduplic^tsd 
classifier, serving as topic R classifier cleussifier)] , is followed by conanent 
[all pretty] , which specifically echoes every oxie by all; this is higlOy pro- 
ductive, but not obligatory for a-tJ- sentences, 

(13) All the cards are pretty shows a profile in which the redtrplicated 
classifier preceeds noun as topi©, wltL ,<ixorpheme for ^ 33 . again included in 
comment, 

(1 4 ) E^s Is my world shows a profile possible in ttadarin (demonstrative 
alone as topic without following classifier or noun), which does not appear in 
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Han Chinese generally — a*b least not in Mln^ Wu or Cantonesa where demon- 
t?tratives (for thls« that> whie'iy etc. ) would always be followed by a classifier, 
as in the profile shown in sentence (ll), above* 

(^ 5 ) I don*t have; (l6) I don*t have moneys (17) I haven* t aav money * 

In each of these three sentences, a negative morpheine (mei^) occurs before 
Kox'pheme for have (with and without object) or before the object (without int^r- 
i>ening have but the ne^tive (mei ) never occurs in sentence final in Ifendari: « 
In Min, Wu and Cantonese a single morpheme functions lilce the Mandarin nob 
before have (iou^); hence the two Mandarin sentences, (16) and (l?), mi^t ba 
said to merge into one sentence in these non-Mandarin language, and this siagle 
morpheme (e.g. m'Su in Cantonese) may appear as the last morpheme of a sentence, 
parallel to I__don*t have above). Beside such verbal negatives, Ban Ghinea*^ 

•1 

languages also mark negative by a non-verbal morpheme (pu nob in Mandarin)* 
Both the latter and the verbal negative (as auxiliary) appear in sentences befere; 
verbs and adjectival verbs* In the sentence [I] [negative go] , the meaiiing 
-I have not gone with verbal ne^tlve, but I*m not going* sometimes in the senrvs? 
not willing to go with non-verbal negative c In the sentence He negative goc% 

O 

vrith adjectival verb (hau ), the meaning is He*s no good with non-verbal negati/>.’,, 

but He hflsn*t recovered with verbal negative* He*s not bad (pu^)vs. It basnet 

2 

spoiled* become bad (mei )* 

(18) He runs q.\iick 3 .y* The adjective far quick (k*ual^) is transformed 
into an adverb by reduplication plus suffix in Ban Chinese in general* A 
©ingle suffix (te^) follows the (quickj * quic k) in our sample of ’acquired 
i>fendarin*, but in Peldjig Mandarin a non-syllable suffix (-r) is included with 
each member of the reduplication. 



(19) He is running CLuickly on the street * Between the redtplicated ad- 
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verblal phrase for qulclLly (as for 18^ above) and the final verbal phrase 
there is a phrase bracket beginning with a co-verb is at (cai^) foU.owed by a 
place esqpression which includes a comj)ound for street — ( horse road) in 

O 

popular etjniiology, because ma*^ out of compoiand means horse; but in 

3 

historical fact^ ma is a recent borrowing (the first syllable of English *mac- 
adamlzed* )• When not followed by place egression, cal^ appears in such euphemis- 
tic sentences as He*s still living (jt*a^ fcai^jl )^ as in answer to a question 
(whether one’s father is still living); compare He’s not in [pvP cai^J ) 

in response to as k i n g whether someone is in his office. In some languages this 
moxpheme alone (or alternating with cen*^ cai^^ in ’acquired Mandarin’) precedes 
verbs of action in the sense of in the pjrocess of^ as He’s ij^iinning ([t’a^ fcai'^ 

3 

p’au ])» Jhterrogatlves^ and affiimation ( can run fast ) or negation of ability 
(can’t run fast ) in Han Chinese agree largedy on the selection of moiphemes (non- 
verbal negative for not or cannot, and te^ or cognate for can ), but differ in 
the order and cooccurrence of these morphemes 4 

(21) Hals kind can be eaten ; topic functioning eis object of verb is more 
commonJy encountered in Cantonese than in Mandarin or Wu. In the latter lan- 
fiuages^ [l^iis kind] as topic (demonstrative before classifier without following 
noun) usually functions as subject* 

(22) Mblguity as to whether topic functions as object or subject is entire- 
ly removed when empty morpheme for pause (ia^) follows the topic as object: ^ 
for chi cken (pause )^ (zei'o but potential embedded subject) is not eating any 
more^* In relative order of subject (S), object (o), and verb (V), pezmutatlons 
that are possible in Han languages include S70 and 0S7, SV and OV* The OV order 
is exemplified in As for chicken (o) not eating any more (with the subject not 
specified^ but with possible anblguily^ since chicken ml.ght be taken as the sub- 
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Jeat). Stated positlreljr, S may precede V (vltbout speelflcatlon of 0); 0 

may precede V (without specification oJ S); OS nay precede V (ir tha- order) j 

0 may follow V (with or without specif; cation of 3). Stated negatively S never 

follows V; S never follows In sentences with two link verbs, V, and V ); 

0 never precedes V^, (in sentences with link verbs where 3 of Yi precedes 7^. 

whose object follows; but this object functions as the si&Jeot of Vg In pivotal 

0/3 sequence S 4-0/s-> Vg); 03 before V Is never Inverted to 30 before V 

(experimental attempts to say 3C before V are understop'A as 08, since 0 and. 3 

are not marked for their function as object or subject except by their relative 
order la respect to V). 

^ Is Just apt to come (can poss bj^^come). Ihe oonsnent begins 

with huel can (which would be followed uy the cognate of ttfi In WU) 

which occurs, as auxlllaiy verb, with and without phrase nucleus verb, la such 
questions as S R , yoV . (notcaaj (huel^ pu° huel^) and Can you do It? 

(3^ I^3jnjoJ (huel^^ cuo^ pu^ huel^ cuoO). o!he tqplc functions as 
subject, ffl^st Is the ta^anslatlon unit of Irihe compound (guest person ), ih 
Mandarin, the first meniber of the compound is free, while for (host perscm) the 
first m^er is bound — l.e. appears only In ccmpound. But In Cantonese the 
cognates for g u est as well as host are bound — found only In compounds. 

eaten noodles (as you can see) . !Uhe perfective suffix after 
bhe verb (le ) has the same shape as the sentence final suffix (le^, out of 
bie,ck*.b) In Mandarin; when both are used, as here, both meanings are marked, 
but arnblgulty is possible when onOy one le^ appears at the end of a sentence 
(as In 25). Such ambiguity Is possible neither in Wu nor In Cantonese, since 
bhe two mozph^nes parallel In function to le are entirely different in shape. 

And In general tijere Is more differentiation of the kind suggested here In the 
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relatively small area of Ohine. In vhlch Va is spoken than In the three-fourbhs 
of CShlna In viilch M a n d ar i n Is spoken In pcurblenlar, there Is lebs dlfferenkl<» 
atlon In person pronouns In Mandarin dialects. 

(25) With le^ at the end of the sentence: If pexfectlve^ the sense Is Iha 

(for le^); If modal or situational sentence suffix^ out of 
bracket, the sense Is ( Oh. look) , a guest Ip (for le^2)* ^ 

tonese, the perfective suffix has one shape, *010 sentence situational suffix 
another, and the functions of the two are thereby kept distinct when either 
occurs at the end of a sentence. 

(26) Xhere appears to be no parallel in Cantonese to cal^ future again. 
vhl<^ occurs only with link verbs (28) or with final situational suffix; this 
usage Is more cconon In Wu than In MandEirin, He won’t ever come again. ( that*s 
the situation) . When the order of the fliozphemes In the comment Is changed from 
those marking future again also not to also not future again, a new sentence Is 
formed: gB_ also l a not eomirt g again f that»s It) . 

(27) 3h Cantonese Cehat one stroiaBl appears la place of the reduplication 

favored by WU and SW Mandarin dialects for such sentences as Let*s chat a little 
more. 

(^3) X-don*t think h e *11 come agedn. This sentence shows link verbs (for 
t hink and come) , the linkage Is effected by a bilateral pivot (a noun or, as here, 
a pronoun which serves ob the object of the first link verb and as the stibject 
of the second). Sentence profiles showing link verbs with bilateral pivots ure 
found In all Han Chinese languages. 

(29) Ask him to come again shows the bilateral pivot, t*a^, functioning 
as object of ask. and subject of come. Strings of such pivots may appear in one 
sentence, as that for I ask you to ask him to come . 
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(30) Topic as danse subject (utterable as seUr-sufflcient sentence) 
before comoent is unlvea:«al for Bon Chinese. 

(31) shows an fX] [not X] Interrogative profile (iTgol 
Inot gol ) in Ktandarln. In Cantonese^ when the verb xsorpheiQe or translation 
unit la bisyllablc^ the first sylleble precedes the laoiphenie for not. 

both follow in an X not XT interrogative profile. 

(38) BO-Vcm want tea or coffee? shows verb (V) and object (o) sequence 
before and after the phrase bracket for oji^s • [Tbal^ %sPj replaceable by 

- so that the interrogative profile Is [VO^ - for] - fvo^ , which 

also appears In Wu and Cantonese. More brlefjy, in Ifendarln, [VOj] [V 0 j 3 ds 
an Interrogative profile which ooilts the phrase bracket for for 3 ; a bilaterally 
oonneotlng phrase Is never ooltted In Wu or in Cantonese which distinguish 
noxphemes for either from laozpheiaes for ££• 

The Interrogative prof lie (72(^3 - CbsJ • (Yg Oa^ siEpe^s after a tran* 
sltlve in sentence (33)5 Po you like to eat :cioe or eat noodlest ahe 
transitive verb for llk^_ is not a co-verb with that for to eat beoatisa the 
subject of ms, is e^qpressed In the topic ( you) , while Its object la eaqpressed 
In the Interrogative profile^ as glvon above. Ohe Mandarin translation unit 
for Is a ooeipound fjoy the cognate morphias for this compound 

may he inverted In Wu ( dellfdit__ Joy) . as the translation unit for -fcft 

(3^} J^he Jum ping dowfi? The verb coiqplenient phrase - fdown away^ may 
he uttered once though at the end of the sentence^ though this j^hrase is equally 
relevant to both altemetlvea In the Interrogative profile fv 3 fhot Vj — at 
least so in Cantonese^ Wu, and 'acquired MEmdarIn*. hat in Peking Ibndarln, 
the verb complements are incorporated In each alternative of the Interrogative 
profile s fv down avay l [not V down awav l. 
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Noun and pronoin (interrogative or personal) function in Heoi Chinese In 
general as subject before the verb and as object after the verb, as In (35) and 

(36). 

(37) Has be Junq?ed down? shows interrogative by final sentence suffix 
after the phrase bracket rimap down away perfective*) , an order shared by Jfen- 
darln and Cantonese, but not by Wu. (so also, for sentence 4o). It is possible 
In 'acquired Mandarin' to have the perfective verb L Jump nerfectlvel precede 
the verb caqplenants fdown etwaarl ; this is the obllgatozy order In \i\i» 

(38) Has he Junped down? shows the Intezrogatlve by a two iBorpheme verbal 
wsgaUvs pbnwe, . ^tb] , after Bffg a«wi wwy perfective! a Ir Cantoziese, 
this Interrogative profile appears with a one mozpheine verbal negative; when 
this negative phrase is followed by an interrogative particle, out of the bracket, 
the interrogative is redundantly narked* 

(39) The Mandarin etynon for run differs fron that for Cantonese. The 
latter is cognate with that in l^nyen for run which also appears in Mandarin 
but then shifts in meaning to walk. 

(40) See (37). 

(41) See (37), for HU ead even Cantonese preference for having the equiva- 
lent of the perfective le^ after the verb, and before the verb conpleiiients 

(instead of the Man^rln order of reat down away perfective! , or with quantified 
object — as for an apple, or a bowl of rice — after the peifective verb* 

OSie preferred WU-Cantonese order (perfective after verb) for he ate and tlie pre- 
ferred Mandarin order (perfective after verb cospleiaent or object) may both be 
used for he has eaten. 

(42) le, arose (in the sense of getting up from bed in Mandarin and Wu, 
but of rising body in Cantonese). 
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(43) and (44) chov lal^) as possible 'bransla'tion tuilt for up 
(directional conplement), with verbs of action as stand ( He stood ttp^ . fjy 
( ife Bttt the sa2oe znoiph^nes In phrases with non-action or 

stationaxy verbs, as laugh, thin k, talk, mark another translation unit, In- 
Sgpt^v^ , (resultative coo5>leinent), as in Hh began to thinlc, began to talk, or 
Bh has started to la.nf;h^ Hither translation unit, the diiecti»nfl1 or 
lnp^ptlve-rysul'|;a;fciTg _ is possible when this verb coo5)lement appears in phrases 
after verbs of action: He, has started to flv (beside flown an) , Hence, 
ainblgulty is possible with verbs of action but not with non-active stationary 
verbs. Equivalence of two sequences or translation units for and begin. 
start is widespread among Han Chinese languages. 

Opopare sentence (49)> with the order V- ineentive Q- reoultative for sing 
(V) s o ng (0) — started to sing; for the object (o) the title of a song may 
be given. This Interlocldng order is possible in all dialects, but is not 
equally favored. Compare (50) He walked out (toward sneaker). (51) Put t he 
ff S Poy o ut (with gon^ as topic), (52) He nut the money out (toward 

(53) Hb_ took out a book, in which the dtial directional verb conrpleaent 
is interrupted Ty the objects |ii • . ouVT ) one classifier book *] • 

TXl Wu the dual verb conrolement (out hitherward) is repeated after the object 
&ne classifier book1 , 

(pause), it got broken. Ih Cantonese and Wh, instead 
of the moipheme for broken different fflorphemes with different etyma are selected 
(for S 2 BSL w»aby in Cantonese, and pulverized in Wu). 
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JAF . . 
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SJA . . 
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